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STREPTOCOCCUS PNEUMONIAE PROTEINS AND 

* 

IMMUNOGENIC FRAGMENTS FOR VACCINES 



This application is based on U.S. Provisional Application No. 
60/113,048, filed 21 December 1998, which is hereby incorporated in its 
entirety. 



FIELD OF THE INVENTION 



This invention relates generally to the field of bacterial antigens and their 
use, for example, as immunogenic agents in humans and animals to stimulate 

15 an immune response. More specifically, it relates to the vaccination of 
mammalian species with a polypeptide comprising at least one conserved 
histidine triad residue (HxxHxH) and at least one helix-forming polypeptide 
obtained from Streptococcus pneumoniae as a mechanism for stimulating 
production of antibodies that protect the vaccine recipient against infection by 

2 0 a wide range of serotypes of pathogenic S. pneumoniae. Further, the 
invention relates to antibodies against such polypeptides useful in diagnosis 
and passive immune therapy with respect to diagnosing and treating such 
pneumococcal infections. 



25 In a particular aspect, the present invention relates to the prevention and 

treatment of pneumococcal infections such as infections of the middle ear, 
nasopharynx, lung and bronchial areas, blood, CSF, and the like, that are 
caused by pneumococcal bacteria. 



1 
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BACKGROUND OF THE INVENTION 



Streptococcus pneumoniae is a gram positive bacteria which is a major 
causative agent in invasive infections in animals and humans, such as sepsis, 
5 meningitis, otitis media and lobar pneumonia (Tuomanen et al. New Engl. J. 
Med. 322:1280-1284 (1995)). As part of the infective process, pneumococci 
readily bind to non-inflamed human epithelial cells of the upper and lower 
respiratory tract by binding to eukaryotic carbohydrates in a lectin-like manner 
(Cundell et aL, Micro. Path. 17:361-374 (1994)}. Conversion to invasive 

10 pneumococcal infections for bound bacteria may involve the local generation of 
inflammatory factors which may activate the epithelial cells to change the 
number and type of receptors on their surface (Cundell et al., Nature, 
377:435-438 (1995)). Apparently, one such receptor, platelet activating factor 
(PAF) is engaged by the pneumococcal bacteria and within a very short period 

15 of time (minutes) from the appearance of PAF, pneumococci exhibit strongly 
enhanced adherence and invasion of tissue. Certain soluble receptor analogs 
have been shown to prevent the progression of pneumococcal infections 
(Idanpaan-Heikkila et aL, J. inf. Dis., 176:704-712 (1997)). A number of 
various other proteins have been suggested as being involved in the 

2 0 pathogenicity of S. pneumoniae. There remains a need for identifying 
polypeptides having epitopes in common from various strains of S. 
pneumoniae in order to utilize such polypeptides as vaccines to provide 
protection against a wide variety of S. pneumoniae. 

25 

SUMMARY OF INVENTION 

In accordance with the present invention, there is provided vaccines and 
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vaccine compositions that include polypeptides obtained from S. pneumoniae 
and/or variants of said polypeptides and/or active fragments of such 
polypeptides. 

5 The active fragments, as hereinafter defined, include a histidine triad 

residue(s) and/or coiled coil regions of such polypeptides. 

The term "percent identity" or "percent identical," when referring to a 
sequence, means that a sequence is compared to a claimed or described 
10 sequence from an alignment of the sequence to be compared (the "Compared 
Sequence") with the described or claimed sequence (the "Reference 
Sequence"). The percent identity is determined as follows: 

Percent Identity = [1- (C/R)] 100 

15 

wherein C is the number of differences between the Reference Sequence and 
the Compared Sequence over the length of the alignment between the 
Compared Sequence and the Reference Sequence wherein (i) each base or 
amino acid in the Reference Sequence that does not have an aligned base or 

2 0 amino acid in the Compared Sequence and (ii) each gap in the Reference 
Sequence and (iii) each aligned base or amino acid in the Reference Sequence 
that is different from an aligned base or amino acid in the Compared Sequence, 
each being a difference; and R is the number of bases or amino acids in the 
Reference Sequence over the length of the alignment with the Compared 

25 Sequence with any gap created in the Reference Sequence also being counted 
as a base or amino acid. 

If an alignment exists between the Compared Sequence and the 
Reference Sequence in which the Percent Identity as calculated above is about 

3 
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equal to or greater than a specified minimum Percent Identity than the 
Compared Sequence has the specified minimum Percent Identity to the 
Reference Sequence even though alignments may exist in which the 
hereinabove calculated Percent Identity is less than the specified Percent 
5 Identity. 



"Isolated" in the context of the present invention with respect to 
polypeptides and/or polynucleotides means that the material is removed from 
its original environment (e.g., the natural environment if it is naturally 
occurring). For example, a naturally-occurring polynucleotide or polypeptide 
present in a living organism is not isolated, but the same polynucleotide or 
polypeptide, separated from some or all of the co-existing materials in the 
natural system, is isolated. Such polynucleotides could be part of a vector 
and/or such polynucleotides or polypeptides could be part of a composition, 
and still be isolated in that such vector or composition is not part of its natural 
environment. The polypeptides and polynucleotides of the present invention 
are preferably provided in an isolated form, and preferably are purified to 
homogeneity. 



BRIEF DESCRIPTION OF DRAWINGS 

Figures 1A-1C, respectively, report the results of three experiments 
using different preparations of SP36. The results demonstrate that active 
immunization with recombinant SP36 derived from pneumococcal strain 
Norway serotype 4 is able to protect mice from death in a model of 
pneumococcal sepsis using a heterologous strain, SJ2 (serotype 6B). In each 
of the three experiments shown, one hundred percent of the mice immunized 
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with SP36 survived for the 14-day observation period following challenge 
with approximately 500 cfu of pneumococci, while eighty to one hundred 
percent of sham-immunized mice (injected with PBS and adjuvant) died 
during the same period. 

5 

Figures 2A-2B show that passive administration of rabbit antiserum 
raised against Sp36 derived from Norway type 4 was able to protect mice in 
the pneumococcal sepsis model using two heterologous strains. Figure 2A 
shows that one hundred percent of the mice immunized with the SP36 

10 antiserum survived the 21-day observation period after challenge with 172 
CFU of strain SJ2 (serotype 6B). Eighty percent of the mice immunized 
with a control serum (rabbit anti-FimC) died by day 8, and ninety percent 
died by day 12. Figure 2B shows that 90 percent of the mice immunized 
with the Sp36 antiserum survived the 8-day observation after challenge with 

15 862 CFU of strain EF6796 (serotype 6A). Ninety percent of the mice 
immunized with a control serum (collected before immunization) died by day 
5. 

Figure 3 is a western blot demonstrating the ability of antisera raised 
2 0 against recombinant Sp36 derived from strain Norway type 4 to react with 
Sp36 of heterologous strains. Total cell lysates were immunoblotted with 
mouse antisera to Sp36. A band representing Sp36 protein was detected in 
all 23 S. pneumoniae strains tested, which included isolates from each of the 
23 pneumococcal serotypes represented in the current polysaccharide 
25 vaccine. 

Figure 4 is a Southern blot showing that the Sp36 gene from Norway 
type 4 hybridizes with genomic DNA from 24 other pneumococcal strains, 
indicating the presence of similar sequences in all these strains. 

5 
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Figure 5 is a western blot showing the reactivity of patient sera with 
Sp36. Sp36 (either full-length, panel A; N-terminal half, panel B; or C- 
terminal half, panel C) was electrophoresed by SDS-PAGE and transferred to 
5 nitrocellulose. Patient sera collected soon after the onset of illness (acute 
serum, lanes A) or eight to 30 days later (convalescent serum, lanes C) were 
used to probe the blots. For patients 2, 3, and 5, convalescent serum 
reacted more strongly with Sp36 than did the corresponding acute serum. 

10 Figure 6 is an amino acid alignment comparison of four related 

pneumococcal proteins, namely Sp36A (PhtA; SEQ ID NO:8), Sp36B (PhtB; 
SEQ ID NO:10), Sp36D (PhtD; SEQ ID NO:4), Sp36E (PhtE; SEQ ID NO:6), 
respectively. Dashes in a sequence indicate gaps introduced to maximize the 
sequence similarity. Amino acid residues that match are boxed. 

15 

Figure 7 is a nucleotide alignment comparison of four related 
pneumococcal genes, namely Sp36A (PhtA; SEQ ID NO:9), Sp36B (PhtB; 
SEQ ID NO:11), Sp36D (PhtD; SEQ ID NO:5), Sp36E (PhtE; SEQ ID NO:7), 
respectively. Dashes in a sequence indicate gaps introduced to maximize the 
2 0 sequence similarity. 

Figure 8 shows the results of immunization of mice with PhtD 
recombinant protein, which leads to protection from lethal sepsis. C3H/HeJ 
(Panel A and B) or Balb/cByJ (Panel C) mice were immunized subcutaneously 
25 with PhtD protein (15 jag in 50 |al PBS emulsified in 50 jal complete Freund's 
adjuvant (CFA)). The recombinant PhtD protein used in protection 
experiments consisted of 819 amino acid residues, starting with the cysteine 
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(residue 20). A group of 10 sham-immunized mice received PBS with 
adjuvant. A second immunization of 1 5 jig protein with incomplete Freund's 
adjuvant (IFA) was administered 3 weeks later; the sham group received PBS 
with IFA. Blood was drawn (retro-orbital bleed) at week 7; and sera from 
5 each group was pooled for analysis of anti-PhtD antibody by ELISA. Mice 
were challenged at week 8 by an rntraperitonial (Lp.) injection of 
approximately 550 CFU S. pneumoniae strain SJ2, serotype 6B (Panel A), 
850 CFU of strain EF6796, serotype 6A (Panel B) or 450 CFU of strain 
EF5668, serotype 4 (Panel C). In preliminary experiments, the LD 50 for strain 

10 SJ2 and EF6796 were determined to be approximately 10 CFU for both 
strains. The LD 50 for strain EF5668 was determined to be < 5 CFU. Survival 
was determined in all groups over the course of 1 5 days following challenge. 
Data are presented as the percent survival for a total of 10 mice per 
experimental group. Two-sample Log-rank test was used for statistical 

15 analysis comparing recombinant Pht immunized mice to sham-immunized 
mice. 

SUMMARY OF THE INVENTION 

20 

In accordance with one aspect of the present invention, there is 
provided a vaccine, generally in the form of a composition, that includes at 
least one polypeptide that is at least 90% identical to (c) a polypeptide 

7 
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sequence comprising amino acids 1-819 of SEQ ID NO:4 or (ii) a polypeptide 
sequence comprising amino acids 1-460 of SEQ ID NO:6 or an active fragment 
of the foregoing. 

5 In accordance with another aspect of the present invention, there is 

provided a vaccine, generally in the form of a composition, that includes an 
active fragment of a polypeptide that is at least 90% identical to (i) a 
polypeptide comprising amino acids 1-800 of SEQ ID NO:8 or (ii) a polypeptide 
comprising amino acids 1-800 of SEQ ID NO: 10. 

10 

The term "active fragment" means a fragment that includes one or more 
histidine triad residues and/or one or more coiled coil regions. A "histidine 
triad residue" is the portion of the polypeptide that has the sequence HxxHxH 
wherein H is histidine and x is an amino acid other than histidine 

15 

A coiled coil region is the region predicted by "Coils" algorithm: Lupas, 
A., Van Dyke, M., and Stock, J. (1991) Predicting Coiled Coils from Protein 
Sequences, Science 252:1 1 62-1 1 64. 

2 0 In accordance with one embodiment, the active fragment includes both 

one or more histidine triad residues and at least one coiled coil region of the 
applicable polypeptide sequence. In accordance with another embodiment, the 
active fragment includes at least two histidine triad residues. 

2 5 In another embodiment, the active fragment that includes at least one 

histidine triad residue or at least one coiled-coil region of the applicable 
polypeptide includes at least about ten percent of the applicable polypeptide 
and no more than about 85% of the applicable polypeptide. 
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The polypeptide of SEQ ID N0:4 includes five histidine triad residues, as 
follows: 

amino acids 64-69; 188-193; 296-301; 541-546; and 625-630. 

5 

The polypeptide of SEQ ID NO:6 includes five histidine triad residues, as 
follows: 

amino acids 63-68; 185-190; 289-294, 376-381; and 441-446. 

10 

In addition, the polypeptide of SEQ ID NO:4 includes two coiled-coil 
regions (amino acids 120-140 and amino acids 750-772) and the polypeptide 
of SEQ ID NO:6 includes one coiled-coil region (amino acids 1 19-152). 

15 The polypeptide of SEQ ID NO: 8 includes the following regions: 

HxxHxH: amino acids 63-68, 189-194, 309-314, 550-555, 634-639. 
Coiled-coils: amino acids 1 18-145, 406-434, 462-493, 724-751 . 

2 0 In accordance with a further aspect of the invention, a vaccine of the 

type hereinabove described is administered for the purpose of preventing or 
treating infection caused by S. pneumoniae. 

A vaccine, or vaccine composition, in accordance with the present 
2 5 invention may include one or more of the hereinabove described polypeptides 
or active fragments thereof. When employing more than one polypeptide or 
active fragment, such two or more polypeptides and/or active fragments may 
be used as a physical mixture or as a fusion of two or more polypeptides or 
active fragments. The fusion fragment or fusion polypeptide may be produced, 
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for example, by recombinant techniques or by the use of appropriate linkers for 
fusing previously prepared polypeptides or active fragments. 

In an embodiment of the invention, there is provided (a) a polypeptide 
5 that is at least 95% identical or at least 97% identical or 100% identical to (i) 
a polypeptide sequence comprising amino acids 1 to 819 of SEQ ID NO:4 or 
(ii) a polypeptide sequence comprising amino acids 1-460 of SEQ ID NO:6; or 
(b) an active fragment of the polypeptide of (a). 

10 In the case where the polypeptide is a variant of the polypeptide 

comprising the mature polypeptide of SEQ ID NO:4 or SEQ ID NO:6, or any of 
the active fragments of the invention, the variation in the polypeptide or 
fragment is generally in a portion thereof other than the histidine triad residues 
and the coiled-coil region, although variations in one or more of these regions 

15 may be made. 

In many cases, the variation in the polypeptide or active fragment is a 
conservative amino acid substitution, although other substitutions are within 
the scope of the invention. 

20 

In accordance with the present invention, a polypeptide variant includes 
variants in which one or more amino acids are substituted and/or deleted 
and/or inserted. 

25 In another aspect, the invention relates to passive immunity vaccines 

formulated from antibodies against a polypeptide or active fragment of a 
polypeptide of the present invention. Such passive immunity vaccines can be 
utilized to prevent and/or treat pneumococcal infections in patients. In this 
manner, according to a further aspect of the invention, a vaccine can be 

10 
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produced from a synthetic or recombinant polypeptide of the present invention 
or an antibody against such polypeptide. 

In still another aspect the present invention relates to a method of using 
5 one or more antibodies (monoclonal, polyclonal or sera) to the polypeptides of 
the invention as described above for the prophylaxis and/or treatment of 
diseases that are caused by pneumococcal bacteria. In particular, the 
invention relates to a method for the prophylaxis and/or treatment of infectious 
diseases that are caused by S. pneumoniae. In a still further preferred aspect, 
10 the invention relates to a method for the prophylaxis and/or treatment of otitis 
media, nasopharyngeal, bronchial infections, and the like in humans by utilizing 
a vaccine of the present invention. 

Generally, vaccines are prepared as injectables, in the form of aqueous 
15 solutions or suspensions. Vaccines in an oil base are also well known such as 
for inhaling. Solid forms which are dissolved or suspended prior to use may 
also be formulated. Pharmaceutical carriers are generally added that are 
compatible with the active ingredients and acceptable for pharmaceutical use. 
Examples of such carriers include, but are not limited to, water, saline 
2 0 solutions, dextrose, or glycerol. Combinations of carriers may also be used. 

Vaccine compositions may further incorporate additional substances to 
stabilize pH, or to function as adjuvants, wetting agents, or emulsifying 
agents, which can serve to improve the effectiveness of the vaccine. 

25 

Vaccines are generally formulated for parental administration and are 
injected either subcutaneously or intramuscularly. Such vaccines can also be 
formulated as suppositories or for oral administration, using methods known in 
the art. 
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The amount of vaccine sufficient to confer immunity to pathogenic 
bacteria is determined by methods well known to those skilled in the art. This 
quantity will be determined based upon the characteristics of the vaccine 
5 recipient and the level of immunity required. Typically, the amount of vaccine 
to be administered will be determined based upon the judgment of a skilled 
physician. Where vaccines are administered by subcutaneous or intramuscular 
injection, a range of 50 to 500 ng purified protein may be given. 

10 The present invention is also directed to a vaccine in which a 

polypeptide or active fragment of the present invention is delivered or 
administered in the form of a polynucleotide encoding the polypeptide or active 
fragment, whereby the polypeptide or active fragment is produced in vivo. 
The polynucleotide may be included in a suitable expression vector and 

15 combined with a pharmaceutical^ acceptable carrier. 

In addition, the polypeptides of the present invention can be used as 
immunogens to stimulate the production of antibodies for use in passive 
immunotherapy, for use as diagnostic reagents, and for use as reagents in 
2 0 other processes such as affinity chromatography. 

In another aspect the present invention provides polynucleotides which 
encode the hereinabove described polypeptides and active fragments of the 
invention. The polynucleotide of the present invention may be in the form of 
25 RNA or in the form of DNA, which DNA includes cDNA, genomic DNA, and 
synthetic DNA. The DNA may be double-stranded or single-stranded, and if 
single stranded may be the coding strand or non-coding (anti-sense) strand. 

In accordance with another aspect of the present invention, there is 
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provided 

(A) an isolated polynucleotide that is at least 90% identical to a 
polynucleotide sequence encoding (i) a polypeptide comprising amino acids 1- 
81 9 of SEQ ID NO:4 or (ii) a polypeptide comprising amino acids 1-460 of SEQ 

5 ID NO:6, or 

(B) a fragment of the polynucleotide of (A) that encodes an active 
polypeptide fragment or 

(C) a polynucleotide that is at least 90% identical to a 
polynucleotide sequence encoding an active fragment of (i) a polypeptide 

10 comprising amino acids 1 to 800 of SEQ ID NO:8 or (ii) a polypeptide 
comprising amino acids 1 to 800 of SEQ ID NO: 10. 

In specific embodiments, the polynucleotide is at least 95% identical, 
preferably at least 97% identical, and even 100% identical to such 
15 polynucleotide sequence. 

The term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well 
as a polynucleotide which includes additional coding and/or non-coding 
2 0 sequence. 

The present invention further relates to variants of polynucleotides. The 
variants of the polynucleotides may be a naturally occurring allelic variant of 
the polynucleotides or a non-naturally occurring variant of the polynucleotides. 
2 5 The variants include variants in which one or more bases are substituted, 
deleted or inserted. Complements to such coding polynucleotides may be 
utilized to isolate polynucleotides encoding the same or similar polypeptides. 
In particular, such procedures are useful to obtain native immunogenic portions 
of polypeptides from different serotypes of S. pneumoniae, which is especially 

13 
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useful in the production of "chain" polypeptide vaccines containing multiple 
immunogenic segments. 

SEQ ID NO:5 is a representative example of a polynucleotide encoding 
5 the polypeptide of SEQ ID NO:4 and SEQ ID NO:7 is a representative example 
of a polynucleotide encoding the polypeptide of SEQ ID NO:6. SEQ ID NO:9 is 
a representative example of a polynucleotide encoding the polypeptide of SEQ 
ID NO:8, and SEQ ID NO:11 is a representative example of a polynucleotide 
encoding the polypeptide of SEQ ID NO:10. As a result of the known 
10 degeneracy of the genetic code, other polynucleotides that encode the 
polypeptides of SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID NO: 10 
should be apparent to those skilled in the art from the teachings herein. 

The polynucleotides encoding the immunogenic polypeptides described 
above may also have the coding sequence fused in frame to a marker 
sequence which allows for purification of the polypeptides of the present 
invention. The marker sequence may be # for example, a hexa-histidine tag 
supplied by a pQE-9 vector to provide for purification of the mature 
polypeptides fused to the marker in the case of a bacterial host, or, for 
example, the marker sequence may be a hemagglutinin (HA) tag when a 
mammalian host, e.g. COS-7 cells, is used. The HA tag corresponds to an 
epitope derived from the influenza hemagglutinin protein (Wilson, I., et al., Cell, 
37:767 (1984)). 

The present invention also relates to vectors which include 
polynucleotides encoding one or more of the polypeptides of the invention, 
host cells which are genetically engineered with vectors of the invention and 
the production of such immunogenic polypeptides by recombinant techniques 
in an isolated and substantially immunogenically pure form. 

14 
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Host cells are genetically engineered (transduced or transformed or 
transfected) with the vectors comprising a polynucleotide encoding a 
polypeptide of the invention. The vector may be, for example, in the form of a 
5 plasmid, a viral particle, a phage, etc. The engineered host cells can be 
cultured in conventional nutrient media modified as appropriate for activating 
promoters, selecting transformants or amplifying the polynucleotides which 
encode such polypeptides. The culture conditions, such as temperature, pH 
and the like, are those previously used with the host cell selected for 
10 expression, and will be apparent to the ordinarily skilled artisan. 

Vectors include chromosomal, nonchromosomal and synthetic DNA 
sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; 
baculovirus; yeast plasmids; vectors derived from combinations of plasmids 
15 and phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox virus, and 
pseudorabies. However, any other vector may be used as long as it is 
replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a 
2 0 variety of procedures. In general, the DNA sequence is inserted into an 
appropriate restriction endonuclease site(s) by procedures known in the art. 
Such procedures and others are deemed to be within the scope of those skilled 
in the art. 

25 The DNA sequence in the expression vector is operatively linked to an 

appropriate expression control sequence(s) (promoter) to direct mRNA 
synthesis. As representative examples of such promoters, there may be 
mentioned: LTR or SV40 promoter, the E. coli. lac or trp , the phage lambda P L 
promoter and other promoters known to control expression of genes in 
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prokaryotic or eukaryotic cells or their viruses. The expression vector also 
contains a ribosome binding site for translation initiation and a transcription 
terminator. The vector may also include appropriate sequences for amplifying 
expression. 



In addition, the expression vectors preferably contain one or more 
selectable marker genes to provide a phenotypic trait for selection of 
transformed host cells such as dihydrofolate reductase or neomycin resistance 
for eukaryotic cell culture, or such as tetracycline or ampicillin resistance in 
10 coli. 



The vector containing the appropriate DNA sequence as hereinabove 
described, as well as an appropriate promoter or control sequence, may be 
employed to transform an appropriate host to permit the host to express the 
15 proteins. 

As representative examples of appropriate hosts, there may be 
mentioned: bacterial cells, such as E. coli , Streptomyces , Salmonella 
typhimurium ; fungal cells, such as yeast; insect cells such as Drosophila S2 
2 0 and Spodoptera Sf9 ; animal cells such as CHO, COS or Bowes melanoma; 
adenoviruses; plant cells, etc. The selection of an appropriate host is deemed 
to be within the scope of those skilled in the art from the teachings herein. 

More particularly, the present invention also includes recombinant 
25 constructs comprising one or more of the sequences as broadly described 
above. The constructs comprise a vector, such as a plasmid or viral vector, 
into which a sequence of the invention has been inserted, in a forward or 
reverse orientation. In a preferred aspect of this embodiment, the construct 
further comprises regulatory sequences, including, for example, a promoter, 
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operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially 
available. The following vectors are provided by way of example. Bacterial: 
pQE70, pQE60, pQE-9 (Qiagen, Inc.), pbs, pD10, phagescript, psiX174, 
5 pbluescript SK, pbsks, pNH8A, pNH16a, pNH18A, pNH46A (Stratagene); 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: 
pWLNEO, pSV2CAT, pOG44, pXT1, pSG (Stratagene) pSVK3, pBPV, pMSG, 
pSVL (Pharmacia). However, any other plasmid or vector may be used as long 
as they are replicable and viable in the host. 

10 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. 
Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include laci, lacZ, T3, T7, gpt, lambda P R , P L and TRP. Eukaryotic 
15 promoters include CMV immediate early, HSV thymidine kinase, early and late 
SV40, LTRs from retrovirus, and mouse metallothionein-l. Selection of the 
appropriate vector and promoter is well within the level of ordinary skill in the 
art. 

2 0 In a further embodiment, the present invention relates to host cells 

containing the above-described constructs. The host cell can be a higher 
eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such as a 
yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. 
Introduction of the construct into the host cell can be effected by calcium 

2 5 phosphate transfection, DEAE-Dextran mediated transfection, or 
electroporation (Davis, L., Dibner, M., Battey, I., Basic Methods in Molecular 
Biology, (1986)). 

The constructs in host cells can be used in a conventional manner to 
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produce the gene product encoded by the recombinant sequence. 
Alternatively, the polypeptides of the invention can be synthetically produced 
by conventional peptide synthesizers. 

5 Mature proteins can be expressed in mammalian cells, yeast, bacteria, 

or other cells under the control of appropriate promoters. Cell-free translation 
systems can also be employed to produce such proteins using RNAs derived 
from the DNA constructs of the present invention. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are described 
10 by Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second Edition, 
Cold Spring Harbor, N.Y., (1989), the disclosure of which is hereby 
incorporated by reference. 

Transcription of the DNA encoding the polypeptides of the present 
15 invention by higher eukaryotes is increased by inserting an enhancer sequence 
into the vector. Enhancers are cis-acting elements of DNA, usually about from 
10 to 300 bp that act on a promoter to increase its transcription. Examples 
including the SV40 enhancer on the late side of the replication origin bp 100 to 
270, a cytomegalovirus early promoter enhancer, the polyoma enhancer on the 
2 0 late side of the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include origins of 
replication and selectable markers permitting transformation of the host cell, 
e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and 
25 a promoter derived from a highly-expressed gene to direct transcription of a 
downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), a- 
factor, acid phosphatase, or heat shock proteins, among others. The 
heterologous structural sequence is assembled in appropriate phase with 
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translation initiation and termination sequences. Optionally, the heterologous 
sequence can encode a fusion protein including an N-terminal identification 
peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

5 

Useful expression vectors for bacterial use are constructed by inserting 
a structural DNA sequence encoding a desired protein together with suitable 
translation initiation and termination signals in operable reading phase with a 
functional promoter. The vector will comprise one or more phenotypic 

10 selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host. Suitable 
prokaryotic hosts for transformation include E. coli , Bacillus subtilis , Salmonella 
typhimurium and various species within the genera Pseudomonas, 
Streptomyces, and Staphylococcus, although others may also be employed as 

15 a matter of choice. 

As a representative but noniimiting example, useful expression vectors 
for bacterial use can comprise a selectable marker and bacterial origin of 
replication derived from commercially available plasmids comprising genetic 
20 elements of the well known cloning vector pBR322 (ATCC 37017). Such 
commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, Wl, USA). 
These pBR322 "backbone" sections are combined with an appropriate 
promoter and the structural sequence to be expressed. 

25 

Following transformation of a suitable host strain and growth of the 
host strain to an appropriate cell density, the selected promoter is induced by 
appropriate means (e.g., temperature shift or chemical induction) and cells are 
cultured for an additional period. 
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Cells are typically harvested by centrifugation, disrupted by physical or 
chemical means, and the resulting crude extract retained for further 
purification. 

5 

Microbial cells employed in expression of proteins can be disrupted by 
any convenient method, including freeze-thaw cycling, sonication, a french 
press, mechanical disruption, or use of cell lysing agents, such methods are 
well know to those skilled in the art. However, preferred are host cells which 
0 secrete the polypeptide of the invention and permit recovery of the polypeptide 
from the culture media. 

Various mammalian cell culture systems can also be employed to 
express recombinant protein. Examples of mammalian expression systems 

5 include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, 
Cell, 23:175 (1981), and other cell lines capable of expressing a compatible 
vector, for example, the CI 27, 3T3, CHO, HeLa and BHK cell lines. 
Mammalian expression vectors will comprise an origin of replication, a suitable 
promoter and enhancer, and also any necessary ribospme binding sites, 

0 polyadenylation site, splice donor and acceptor sites, transcriptional 
termination sequences, and 5' flanking nontranscribed sequences. DNA 
sequences derived from the SV40 splice, and polyadenylation sites may be 
used to provide the required nontranscribed genetic elements. 

5 The polypeptides can be recovered and/or purified from recombinant cell 

cultures by well-known protein recovery and purification methods. Such 
methodology may include ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity 

20 



0O37105A2 _l_> 



WO 00/37105 PCT/US99/30390 

chromatography, hydroxylapatite chromatography and lectin chromatography. 
Protein refolding steps can be used, as necessary, in completing configuration 
of the mature protein. In this respect, chaperones may be used in such a 
refolding procedure. Finally, high performance liquid chromatography (HPLC) 
5 can be employed for final purification steps. 

The polypeptides that are useful as immunogens in the present invention 
may be a naturally purified product, or a product of chemical synthetic 
procedures, or produced by recombinant techniques from a prokaryotic or 
10 eukaryotic host (for example, by bacterial, yeast, higher plant, insect and 
mammalian cells in culture). Depending upon the host employed in a 
recombinant production procedure, the polypeptides of the present invention 
may be glycosylated or may be non-glycosylated. 

15 Procedures for the isolation of the individually expressed polypeptides 

may be isolated by recombinant expression/isolation methods that are well- 
known in the art. Typical examples for such isolation may utilize an antibody 
to a conserved area of the protein or to a His tag or cleavable leader or tail that 
is expressed as part of the protein structure. 

20 

The polypeptides, their fragments or other derivatives, or analogs 
thereof, or cells expressing them can be used as an immunogen to produce 
antibodies thereto. These antibodies can be, for example, polyclonal or 
monoclonal antibodies. The present invention also includes chimeric, single 
25 chain, and humanized antibodies, as well as Fab fragments, or the product of 
an Fab expression library. Various procedures known in the art may be used 
for the production of such antibodies and fragments. 

Antibodies generated against the polypeptides corresponding to a 
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r 

sequence of the present invention can be obtained by direct injection of the 
polypeptides into an animal. 

For preparation of monoclonal antibodies, any technique which provides 
5 antibodies produced by continuous cell line cultures can be used. Examples 
include the hybridoma technique (Kohler and Milstein, 1975, Nature, 256:495- 
497), the trioma technique, the human B-cell hybridoma technique (Kozbor et 
al., 1983, Immunology Today 4:72), and the EBV-hybridoma technique to 
produce human monoclonal antibodies (Cole, et al., 1985, in Monoclonal 
10 Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. 
Patent 4,946,778) can be adapted to produce single chain antibodies to 
immunogenic polypeptide products of this invention. Also, transgenic mice 
E may be used to express humanized antibodies to immunogenic polypeptide 
15 products of this invention. 

The invention will be further described with respect to the following 
examples; however, the scope of the invention is not limited thereby: 

20 Example 1 

Active Protection with Anti-Sp36 

A. Cloning r expression, and purification of SP36 

2 5 The genomic DNA used as target for amplification was isolated from 

S. pneumoniae Norway strain (serotype 4), the same strain used for genomic 
sequencing. The complete sequence of the Sp36 gene (SEQ ID NO:9), and 
its predicted amino acid sequence (SEQ ID NO:8), are given in the Sequence 
Listing appended hereto. It was noted that the predicted amino acid 
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sequence included a hydrophobic leader sequence followed by a sequence 
(LSVC) similar to the consensus sequence for Type II signal peptidase (LxxC, 
in which both x's typically represent small amino acids). Primers (listed as 
SEQ ID NOS:1-3) were designed that would amplify the Sp36 gene and allow 

5 its cloning into pQE10 and expression as a histidine-tagged protein lacking 
the signal sequence for purification by nickel-affinity chromatography. 
Cloning of the fragment amplified by SEQ ID Nos 1 and 3 would result in a 
protein containing amino acids 2 through 800 of Sp36; cloning of the 
fragment amplified by SEQ ID Nos 2 and 3 would result in a protein 

0 containing amino acids 7 through 800 of Sp36 (amino acid numbers refer to 
SEQ ID NO:8). 

B. Active Protection With Sp36 Vaccination 

5 In each of the three experiments shown in Figures 1A-1C, C3H/HeJ 

mice (10/group) were immunized intraperitoneal^ (i.p.) with Sp36 protein 
(15 ng in 50 |il PBS emulsified in 50 fil complete Freund's adjuvant (CFA)). A 
group of 10 sham-immunized mice received PBS with adjuvant. A second 
immunization of 1 5 jag protein with incomplete Freund's adjuvant (IFA) was 

0 administered 4 weeks later; the sham group received PBS with IFA. Blood 
was drawn (retro-orbital bleed) at weeks 3, 6, and 9; and sera from each 
group were pooled for analysis of anti-Sp36 antibody by ELISA. Mice were 
challenged at week 10 by an i.p. injection of approximately 500 CFU S. 
pneumoniae strain SJ2 (serotype 6B; provided by P. Flynn, St. Jude 

5 Children's Research Hospital, Memphis, TN). In preliminary experiments, the 
LD 50 of this strain was determined to be approximately 10 CFU. Mice were 
monitored for 14 days for survival. 

The three experiments shown in Figures 1A-1C used slightly different 
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preparations of recombinant Sp36. The experiments shown in Figure 1A and 
1B both used Sp36 containing amino acids 20-815, but different batches of 
protein were used in the two experiments. The experiment shown in Figure 
1C used Sp36 containing amino acids 25-815. 

5 

In the experiment shown in Figure 1A, 9-week sera collected from the 
ten mice immunized with Sp36 (first batch) had an endpoint ELISA titer of 
1:4,096,000. No anti-Sp36 antibody was detected in sera from sham- 
immunized mice. One hundred percent of the mice immunized with Sp36 
10 protein survived the challenge (520 cfu of pneumococci) for 14 days. Eighty 
percent of sham-immunized mice were dead by day 4, and the remainder 
survived. 



In the experiment shown in Figure 1 B, 9-week sera collected from the 
15 ten mice immunized with Sp36 (second batch) had an endpoint ELISA titer of 
> 1 :4,096,000. No anti-Sp36 antibody was detected in sera from sham- 
immunized mice. One hundred percent of the mice immunized with Sp36 
protein survived the challenge (510 cfu of pneumococci) for 14 days. Of the 
sham-immunized mice, eighty percent were dead by day 4, and all died by 
20 day 9. 

In the experiment shown in Figure 1 C # 9-week sera collected from the 
ten mice immunized with Sp36 (containing amino acids 25- 815) had an 
endpoint ELISA titer of 1 :4, 096,000. No anti-Sp36 antibody was detected in 
2 5 sera from sham-immunized mice. One hundred percent of the mice 
immunized with Sp36 protein survived the challenge (510 cfu of 
pneumococci) for 14 days. Of the sham-immunized mice, ninety percent died 
by day 4, and all died by day 12. These data demonstrate that immunization 
of mice with recombinant Sp36 proteins elicits a response capable of 
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protecting against systemic pneumococcal infection and death. This 
protection was not strain-specific: the recombinant pneumococcal protein 
was cloned from a serotype 4 strain, while the challenge was with a 
heterologous strain, SJ2 (serotype 6B). 

Example 2 

Passive Protection with Anti-Sp36 Antisera 



A. Generation of Rabbit immune Sera 



Following collection of preimmune serum, a New Zealand White rabbit 
was immunized with 250 jig of Sp36 (containing amino acids 20-815) in 
CFA. The rabbit was given two boosts of 125 \ig Sp36 in IFA on days 29 
and 50 and bled on days 39 and 60. A second rabbit was immunized with a 
15 control antigen, E. coli FimC. 



B. Passive Protection in Mice 



C3H/HeJ mice (10 mice/group) were passively immunized by two i.p. 
20 injections of 100 fil of rabbit serum. The first injection was administered 
twenty-four hours before challenge with 1 72 cfu of S. pneumoniae strain 
SJ2, and the second injection was given four hours after challenge. Figure 2 
shows the survival of mice after infection with two different strains of 
pneumococci. 



Figure 2A shows that of mice injected with 1 72 cfu of strain SJ2 
(Figure 2A), one hundred percent of the mice immunized with rabbit immune 
serum raised against Sp36 protein survived the 21 -day observation period. 
Of the mice immunized with the control serum (anti-FimC), eighty percent 
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died by day 8, and ninety percent died by day 12, Figure 2B shows that of 
mice injected with 862 cfu of strain EF6796, ninety percent of the mice 
immunized with rabbit immune serum raised against Sp36 protein survived 
the 8-day observation period. Of those given a control serum (collected from 
5 a rabbit before immunization), ninety percent died by day 8. 

These data indicate that the protection against pneumococcal infection 
resulting from immunization with Sp36 is antibody-mediated, since mice can 
be protected by passive transfer of serum from a hyperimmunized rabbit. As 
10 seen in the mouse active challenge experiments described above, serum 
directed against recombinant Sp36 protein cloned from a serotype 4 strain 
was protective against challenge with heterologous strains. 

Example 3 

15 Conservation of Sp36 Among Strains of 5. pneumoniae 

A. Western blotting 

The 23 pneumococcal strains used in this experiment were obtained 
2 0 from the American Type Culture Collection (Rockville, MD) and include one 
isolate each of the 23 serotypes in the multivalent pneumococcal vaccine. 
For total cell lysates, pneumococci were grown to mid-logarithmic phase 
(optical density at 620 nm, 0.4 to 0.6) in 2 ml Todd-Hewitt broth with 0.5% 
yeast extract {Difco, Detroit, ME) at 37°C. Bacteria were harvested by 
25 centrifugation and washed twice with water. Pellets were resuspended in 
200 yx\ lysis buffer (0.01% sodium dodecyl sulfate, 0.15 M sodium citrate 
and 0.1% sodium deoxycholate) and incubated at 37°C for 30 min, then 
diluted in an equal volume 2x SSC (0.3 M sodium chloride, 0.03 M sodium 
citrate). Lysates were separated by SDS-PAGE, transferred to nitrocellulose 
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membranes (Bio-Rad Laboratories, Hercules, CA), and probed with antibody 
in a standard Western blotting procedure. Sera from ten C3H/HeJ mice 
immunized with Sp36 (as described in Example 1) were pooled and used at a 
dilution of 1:3000. Bound antibody was detected with peroxidase- 
5 conjugated sheep anti- mouse IgG using the chemiiuminescence kit from 
Amersham, Inc. (Cambridge, MA). 

The mouse anti-Sp36 sera detected two major bands with apparent 
molecular weights of 97 and 100 kDa in all 23 pneumococcal lysates tested 
10 (shown in Figure 3). The Sp36 signals obtained from S. pneumoniae 
serotypes 1, 5, 17F and 22F were lower, indicating either that the level of 
Sp36 expression is reduced in these strains, or that Sp36 in these strains is 
antigenically different. 

15 These data show that Sp36 is antigenically conserved among strains 

of the 23 pneumococcal serotypes represented in the current polysaccharide 
vaccine. 

20 B. Southern blotting 

Genomic DNA was prepared from each of the 23 pneumococcal 
strains listed in the previous section and also from strain SJ2. DNA was 
digested with PvuW and Bam\r\\ f electrophoresed in an agarose gel and 
25 transferred to a nylon membrane. A probe was prepared by amplifying the 
Sp36 gene from Norway type 4 DNA (as in Example 1) and labeling the 
amplified fragment with fluorescein by the random-priming method, using a 
kit from Amersham, Hybridization, washing, and exposure of film were 
carried out as in the protocol supplied by Amersham. Figure 4 shows that 
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the Sp36 probe hybridized with DNA from each of the 24 strains studied. 
The lane marked "M" contained DNA from lambda phage, digested with 
Hind\\\ and labeled with fluorescein, as molecular weight markers. 

5 Example 4 

Immunogenicity of Sp36 in Humans 

In order to determine whether Sp36 is immunogenic during human 
pneumococcal infection, sera from patients with culture-proven 

10 pneumococcal bacteremia were used in Western blots containing 
recombinant Sp36 protein. In the experiment shown in Figure 5, sera from 
five patients (indicated as 1 through 5) were diluted 1 :3000 and used to 
probe blots containing full-length Sp36, the N-terminal half of Sp36 
(preceding the proline-rich region), or the C-terminal half of Sp36 (following 

15 the proline-rich region). Lanes labeled A (acute) were probed with serum 
collected shortly after diagnosis of pneumococcal infection; lanes C 
(convalescent) were probed with serum collected either one month later 
(patients 1, 2, and 3) or eight days after the first serum collection (patients 4 
and 5). For patients 2, 3 and 5, reactivity of the convalescent serum with 

2 0 Sp36 was stronger that that of the corresponding acute serum. The 
difference between the acute and convalescent sera was particularly evident 
for reactivity with the C-terminal half of the protein. 

In additional experiments (not shown), convalescent sera from 23 
25 patients with pneumococcal infections were tested individually for reactivity 
_with full-length Sp36: 20 of the 23 sera were found to bind Sp36 on a 
Western blot. 

These experiments indicate that Sp36 is recognized by the human 
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immune system and suggest that antibodies able to bind the Sp36 protein 
may be produced during natural S. pneumoniae infection in humans. Since 
the patients were infected with a variety of pneumococcal strains, these data 
also support the idea that Sp36 is antigenically conserved. 

Example 5 



Table 1 provides the percent identity between the various sequences. 

10 Alignment of the predicted amino acid sequences of PhtA, PhtB, PhtD, 

and PhtE using the MEGAL1GN program of Lasergene showed strong N- 
terminal homology with substantial divergence of the C-termini (Figure 6). 
The alignment of the nucleotide sequences of the same genes is shown in 
Figure 7. Amino acid and nucleotide sequences were compared using the 

15 identity weighting in a Lipman-Pearson pairwise alignment, in which the 
number of matching residues is divided by the total of matching residues plus 
the number of mismatched residues plus the number of residues in gaps. In . 
the table below, the percent identity between each pair of sequences is 
shown at the intersection of the corresponding row and column, 

20 

Example 6 

Active Protection with PhtD Vaccination. 

Mice immunized with recombinant PhtD derived from strain N4 
generated potent antibody titers (reciprocal endpoint titers ranging form 
25 2,048,00 to 4,096,000). Mice immunized with PhtD were protected against 
death following intraperitoneal injection with either of three heterologous 
strains, SJ2 (serotype 6B; provided by P. Flynn, St. Jude Children's Research 
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Hospital, Memphis, TN), EF6796 (serotype 6A) or EF5668 (serotype 4; both 
strains provided by D. Briles, University of Alabama, Birmingham). In the 
experiment shown in Figure 8 (Panel A), all ten of the sham-immunized mice 
died within 10-days after challenge with virulent pneumococci (strain SJ2), 
5 while eighty percent of the PhtD-immunized mice survived the 1 5-day 
observation period. Immunization with PhtD also protected against a serotype 
6A strain, EF6796 (Panel B) and a serotype 4 strain, EF5668 (Panel C). In 
the experiment shown in Figure 8 (Panel B), all ten of the sham-immunized 
mice died within 7-days after challenge with virulent pneumococci (strain 
10 EF6796), while ninety percent of the PhtD-immunized mice survived the 15- 
day observation period. In the experiment shown in Figure 8 (Panel C), all ten 
of the sham-immunized mice died within 6-days after challenge with virulent 
pneumoccoci (strain EF5668), while eight of nine mice immunized with PhtD 
survived the 15-day observation period. 

15 
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Table 1 . Percent Identities 




Percent Identity Between Amino Acid Sequences 


PhtA PhtB PhtD 


PhtE 


PhtA — 66.4 63.9 


49.5 


PhtB — 87.2 


49.5 


PhtD 


49.8 


PhtE 




Percent Identity Between Nucleotide Sequences 


PhtA PhtB PhtD 


PhtE 


PhtA — 58.3 59.3 


47.9 


PhtB — 86.4 


47.4 


PhtD — 


47.9 


PhtE 
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1 . A vaccine composition comprising: 

(a) at least one member selected from the groups consisting 
5 of (i) a polypeptide comprising a polypeptide sequence that is at least 90% 

identical to amino acids 1-819 of SEQ ID NO:4; (ii) a polypeptide comprising 
a polypeptide sequence that is at least 90% identical to amino acids 1-460 
of SEQ ID NO:6; (iii) a fragment of the polypeptide of (i) that includes at least 
one of a histidine triad residue or coiled-coil region; (iv) a fragment of the 

10 polypeptide of (ii) that includes at least one of a histidine triad residue or a 
coiled-coil region; (v) a fragment of a polypeptide that is at least 90% 
identical to the polypeptide sequence comprising amino acids 1-800 of SEQ 
ID NO:8, wherein said fragment includes at least one of a histidine triad 
residue or coiled-coil region wherein said fragment includes at least 80 amino 

15 acids and no more than 680 amino acids; and (vi) a fragment of a 
polypeptide that is at least 90% identical to the polypeptide sequence 
comprising amino acids 1-800 of SEQ ID NO: 10, wherein said fragment 
includes at least one of a histidine triad residue or coiled-coil region wherein 
said fragment includes at least 80 amino acids and no more than 680 amino 

20 acids; and 

(b) a pharmaceutical^ acceptable carrier. 

2. A process for preventing infection caused by S. pneumoniae 
comprising: 

25 administering the vaccine of claim 1 . 

3. A vaccine composition comprising: 

(a) at least one antibody against a member selected from the 
group consistng of (i) a polypeptide comprising a polypeptide sequence that 
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is at least 90% identical to amino acids 1-819 of SEQ ID NO:4; (ii) a 
polypeptide comprising a polypeptide sequence that is at least 90% identical 
to amino acids 1-460 of SEQ ID NO:6; (iii) a fragment of the polypeptide of 
(i) that includes at least one of histidine triad residue or coiled-coil region; (iv) 
5 a fragment of the polypeptide of (ii) that includes at least one of a histidine 
triad residue or a coiled-coil region; (v) a fragment of a polypeptide that is at 
least 90% identical to the polypeptide sequence comprising amino acids 1- 
800 of SEQ ID NO:8, wherein said fragment includes at least one of a 
histidine triad residue or coiled-coil region wherein said fragment includes at 

10 least 80 amino acids and no more than 680 amino acids and (vi) a fragment 
of a polypeptide that is at least 90% identical to the polypeptide sequence 
comprising amino acids 1-800 of SEQ ID NO: 10, wherein said fragment 
includes at least one of a histidine triad residue or coiled-coil region wherein 
said fragment includes at least 80 amino acids and no more than 680 amino 

15 acids. 
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Figure 1 
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Figure 2 

A. Strain SJ2 (serotype 6B) 
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B. S train EF6796 (serotype 6A) 
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Figure 6 (a) 
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Figure 6 (b) 
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680 CAA-ATCCAGCTCA--ACCA AGATTGTCAGAGAACCACAAT CT phtD.SEQ 

728 CAA CAGCT-AGT GACAAT A A C A CGCAATCTGT phtE.SEQ 



G A 



CA-AAGCT GTCACTCCAACATTATCA-TCAACCAAATCAAGGTGAAAA Majority 

| 1 I I 1 

760 770 780 790 800 

808 AACACAAGCAACAACAGCAACACTAACAGTCAAGCAAGTCAAAGTAATGA phtA.SEQ 

717 CA CTGTCACTCCAAC - TTATCA-TCAA AATCAAGGGGAAAA phtB.seq 

720 CA CTGTCACTCCAAC - TTATCA-TCAA AATCAAGGGGAAAA phtD.SEQ 

759 AGCAAAAG-GATCA ACTAOCAAGCCAGCAAATAAATCTCAAAA phtE.SEQ 



C 



ATTTCAAQT CTTTTCCOTOAATTOTATOCTAAACCTTTATCAOAACGCC Majority 

1 1 I 1 . « 
810 820 830 840 850 
. I 1 1 » 

858 CATTGATAGTCTCTTGAAACAGCTCTACAAACTOCCTTTGAGTCAACGAC phtA.SEQ 

756 CATTTCAAGCCTTTTACGTOAATTOTATGCTAAACCCTTATCAOAACGCC phtB.aaq 
759 CATTTCAAGCCTTTTACQTGAATTGTATOCTAAACCCTTATCAGAACGCC phtD.SEQ 
801 TCTCCAGAGTCTTTTOAAOGAACTCTATOATTCACCTAGCGCCCAACOTT phtE.SEQ 

ATGTGGAATC TOATQOCCTTOTTTTTOACCCAOCOCAAATCACAACTCGA Majority 

i i I I 1 
860 B70 880 890 900 
* 1 i i 1 

908 ATGTAGAATCTGATGGCCTTGTCTTTGATCCAGCACAAATCACAAGTCGA phtA. SBQ 

806 XTGTOGAATCTOATGGCCTTATTTTCOACCCAGCGCAAATCACAAGTCGA phtB.aeoj 
809 ATGTGGAATCTGATGGCCTTATTTTCGACCCAGCGCAAATCACAAGTCGA phtD.SEQ 
8S1 ACAGTGAATCAGATOGCCTGGTCTTTOACCCTGCTAAGATTATCAGTCCT phtE.SEQ 

ACCGCCAOAGGTGTTOCTOTCCCTCATGOTOACCATTACCACTTTATCCC Majority 

910 920 930 940 950 

till. 

958 ACAGCTAGAGGTGTTOCAOTGCCACACGGAGATCATTACCACTTCATCCC phtA.SEQ 

856 ACCGCCAGAGGTGTAGCTGTCCCTCATGGTAACCATTACCACTTTATCCC phtB.««q 

859 ACCGCCAGAGGTOTAGCTOTCCCTCATOGTAACCATTACCACTTTATCCC phtD.SEQ 
901 XCACCAAATGGAOTTQCGATTCCGCATGGCOACCATTACCACTTTATTCC phtE.SEQ 

T T A T G AACAAATOTCTOAATTQQAAQAACOAATTOCTCQTATTATTCCCC Majority 

1 i * i * 

960 970 980 990 1000 

, * I I 

1008 TTACTCTCAAATOTCTGAATTOOAAGAACGAATCOCTCGTATTATTCCCC phtA.SEQ 

906 TTATGAACAAATaTCTGAATTOOAAAAACOAATTGCTCOTATTATTCCCC phtB.a»q 

909 TTATCAACAAATOTCTGAATTGOAAAAACOAATTGCTCOTATTATTCCCC phtD.SEQ 
951 TTACAGCAAGCTTTCTOCCTTAOAAaAAAAOATTOCCAOAAT phtE.SEQ 

TTCGTTATCGTTCAAA CCATTOOGTACCAOATTCAAOACCAOAACAACCA Majority 

1 I I I * 

1010 1020 1030 1040 1050 

t t i * 1 

1058 TTCGTTATCCTTCAAACCATTOGOTACCAOATTCAAGGCCAOAACAACCA phtA.SEO 
956 TTCGTTATCGTTCAAACCATTOOGTACCAOATTCAAOACCAOAAGAACCA pht&.flto 

959 TTCGTTATCGTTCAAACCATTGGGTACCAGATTCAAOACCAGAACAACCA phtD.SEQ 
993 , i - . GGTCCC - - T -ATCAGTGGAACTG phtE.SEQ 
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Figure 7 (d) 



AGTCCACAATCGACTC C G G A ACCTAGTCCA A C TCCCCAACCTCCACCAAA Majority 

— — — i i J 1 r 

1060 1010 1080 1090 1100 

1108 AGTCCACAACCGACTCCGGAACCTAGTCCAGGCCCGCAACCTGCACCAAA phtA.SEQ 

1006 AGTCCACAACCGACTCCAGAACCTAGTCCAAGTCCGCAACC phtfl.aeq 

1009 ACTCCACAATCGACTCCGGAACCTAGTCCAAGTCCGCAACCTGCACCAAA phtO.SEQ 

1013 GTTCTACAGTT " TCTA - - * C A A A TGCX--AAA phtE.SEQ 

TC-T-AA AGCTCCAAOC AATCCAATTOATG - GAAATTGGTCAAAGAAG Majority 

I I I I i 

1110 1120 1130 1140 1150 

' ' i.i 

115B TCTTAAAATAGACTCAA ATTCTTCT TTOGTTAGTCAGC phtA.SEQ 

1047 AGCTCCAAGCAATCCAATTGATGGGAAATTGG7CAAAGAAC phtB.aeq 

10S9 TCCTCAACCAGCTCCAAGCAATCCAATTGATGAGAAATTCGTCAAAGAAG phtO.SEQ 
103 9 CC------- --TAATO-— -------- - - - phtE.SEQ 

CTGTTCGAAAAGTAGOCGATaaTTATGTCTTTaAOGAOAATGaAGTTTCT Majority 

i i I i i 

1150 1170 11B0 1190 1200 

' * 1 ' ■ ' 

1196 TGGTACGAAAAGTTGGGGAAGGATATGTATTCGAAGAAAAGGGCATCTCT phtA.SEQ 

1088 CTGTTCOAAAAGTAGGCOATOOTTATOTCTTTOAOGAGAATOGAGTTTCT phtB.aeq 

1109 CTGTTCGAAAAGTAGGCGXTGOTTATGTCTTTGAOGAGAATGGAGTTTCT phtO.SEQ 
1046 AAGTAO TGTCT AOTCT phtE.SEQ 



CCTTATAT 



CCCAGCCAAGOATCTTTCAOCAaAAACAOCAaCAGGCATT OA Majority 

— : i " i > r 

1210 1220 1230 1240 1250 

■ ' ' 1 » 

1246 CGTTATGTCTTTGCGAAAQATTTACCATCTOAAACTGTTAAAAATCTTaA phtA.SEQ 

1138 CGTTATATCCCAGCCAAGGATCTTTCAaCAGAAACAGCAGCAaGCATTGA phtB.aaq 

11S9 CGTTATATCCCAGCCAAGGATCTTTCAOCAGAAACAOCAGCAaaCATTGA phtO.SEQ 

1062 - - - - _ - . - . AGGC phtS.SEQ 

TAGCAAACTGGCCAAOCAOOAAAGTTTTTCTCATA A OCT A G OAGCTAACA Majority 

1 1 l , ■ . 

1260 1270 1280 - 1290 1300 

* » ' 

1296 AAGCAACTTATCAAAACAAOAOAGTOTTTCACACACTTTAACTGCTAAAA phtA.SEQ 

1188 TAGCAAACTGCCCAAGCAGGAAAOTTTATCTCATAAOCTAQGAACTAAGA phtB.aaq 

1209 TAGCAAACTOGCCAAGCAOGAAAOTTTATCTCATAAaCTAOOAGCTAAOA phtO .SEQ 

1066 AOTCTTTC phtE.SEQ 

* 

AAACTGA-TCTTCCT TCTAQ TQA TCOAGAATTTTAC OAT A ACCCTTATCAC Majority 

I I I 1 i 

1310 1320 1330 1340 1350 

1346 AAGAAAATGTTGCTCCTCOTGACCAAGAATTTTATGATAAAGCATATAAT phtA.SEQ 

1238 AAACTGACCTCCCATCTAQTaATCOAGAATTTTACAATAAGGCTTATGAC phtB.aaq 

1259 AAACTGACCTCCCATCTAGTGATCOAaAATTTTACAATAAGOCfTATGAC phtO.SEQ 

1074 - AAOCAATCCTTCTTCT TTAACOACAAG phtE.SEQ 

TTACTAQCAAOAATTCACCAAQATTTACTTOATAATAAOOOTCOACAAOT Majority 

* i i * i 

1360 1370 1380 1390 1400 

1 . ' ' T 1 

1396 CTGTTAACTGAGGCTCATAAAQCCTTGTTTQMAAATAAGGGTCGTAATTC phtA.SEQ 

1288 TTACTAGCAAOAATTCACCAAOATTTACTTOATAATAAAGGTCOACAACT phtB.aaq 

1309 TTACTAGCAAOAATTCACCAAGATTTACTTOATAATAAAOGTCOACAAGT phtO.SEQ 

1101 T A A G Q A phtE.SEQ 
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Figure 7 (e) 
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1610 1620 1630 1640 1650 

' ' ' 1 1 

1646 GTTACATTTTTOATGAACATGATATAATCAGTOATGAAGOAOATGCATAT pbtA.SEp 

1538 CTTATATCTTTGATCCTCOTOATATAACCAGTOATGAOGOGOATGCCTAT phtB.sftQ 

1559 G T T A T A T C TTTGATCCTCOTGATATAACCAOTGATGAGOOOG AT O C C T A T phtD. SZ? 

1127 GTTATATTTTTAATCC-- - - -- -- -- -- -- -- -- -- -- - - phtE.SEC* 

GTAACTCCACATATOACCCA TAOCCACTO OATTAAAAAAOATAOTTT G T C Majority 

1 I i I I 

1660 1670 1680 1690 1700 

* f ' ii. 

1696 GTAACGCCTCATATGGGCCATAGTCACTOGATTOOAAAAOATAGCCTTTC phtA.SBC* 

1588 GTAACTCCACATATOACCCATAGCCACTOOATTAAAAAAGATAGTTTGTC pbtB.a«(? 

1609 GTAACTCCACATATOACCCATAGCCACTOGATTAAAAAAGATAGTTTGTC phtD.SEp 

1143 AAAAGATA TC phtE.SEQ 

TGAAGCTGAOAGAOCGGCA OCCCAGOCTTA T GCTAAAGAOAAAGGTTTGA Major it 

1 i I 1 I 

1710 1720 1730 1740 1750 

1746 TGATAAGGAAAAAOTTGCAGCTCAAGCCTATACTAAAOAAAAAGGTATCC phtA.SEQ 

1638 TGAAGCTGAGAGAOCOGCAGCCCAGGCTTATOCTRAAOAGAAACIGTTTGA pht&.avO 

1659 TGAAGCTGAGAGAGCGGCAGCCCAGGCTTATQCTAAAGAGAAAGGTTTCA phtD.SEC? 

1153 GTTGAAGAAACOGC . phtZ.SEp; 
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Figure 7(f) 



CCCCTCCTTCGACAG JCATCAqOA TTCAGCAAATACTGAGOCAAAAGOA Majority 

«™ 17 , 70 17B0 1790 18°° 

1796 TACCTCCATCTCCAGACGCAGATGTTAAAGCAAATCCAACTGGAGATAGT phtA.SEQ 

1688 CCCCTCCTTCGACAGACCATCAGGATTCAGGAAATACTGAGCCAAAAGGA phCB..eq 

1709 CCCCTCCTTCGACAGACCATCAGGATTCAGGAAATACTOAGGCAAAAGGA phtO.SEQ 

H57 phtE.SEQ 

OCAGAAGCTATCTACAACCOXGTOAAAOCAGCTAAGA AGGTGCCA C*T T G A Majority 

I ' 1 1 I * 

"?0 1830 1840 185 0 

1844 OCACCAGCTATTTACAATCGTOTGAAACGGOAAAAACGAATTCCACTCCT phtA.SEQ 

17)8 OCAOAACCTATCTACAACCGMOTOAAAGCAGCTAAGAAGGTGCCACTTGA phtB.seq 

1759 CCAOAAGCTATCTACAACCGCCTGAAAGCAOCTAAOAAGGTGCCACTTGA phtO.SEQ 

11C7 - TACAOCT - phtE.SEQ 

TCGTATGCCTTACAATCTTCAATATACTGTAGAAQTCAAAAACGGTAGTT Majority 



I860 1870 1880 1890 1900 

J. 



1 i t i 



1I»C TCCACTTCCATATATOOTTOAOCATACAOTTOAOOTTAAAAACGGTAATT phtA.SEQ 

1718 TCCTATOCCTTACAATCTTCAATATACTOTAGAAGTCAAAAACGGTAGTT phtB.atq 

llOt TCOTATGCCTTACAATCTTCAATATACTOTAGAAGTCAAAAACGGTAGTT phtO.SEQ 

1174 TATATTOTAAOA phtE.SEQ 

TAATCATACCTCATTATOATCATTACCATAACATTAA'AT TTOAOTOOTTT Majority 

'' '11 

1910 1920 1930 1940 1950 

1946 TGATTATTCCTCATAAGGATCATTACCATAATATTAAATTTGCTTGGTTT phtA.SEQ 

1838 TAATCATACCTCATTATOACCATTACCATAACATCAAATTTGAGTOOTTT phtB.a«q 

18S9 TAATCATACCTCATTATOACCATTACCATAACATCAAATTTOAOTCGTTT phtD.SEQ 

1186 CATGGTOATCATTTCCATTACATT phtE.SEQ 

OACOAAOOCCTTTATOAOOCACCTAAOOOOTATACTCTTOAOOATCTTTT Majority 

* 1 • » i 

I960 1970 19B0 1990 2000 

' 1 ' « * 

1996 GATGATCACACATACAAAGCTCCAAATOOCTATACCTTGGAAGATTTGTT phtA.SEQ 

1888 OACGAAGGCCTTTATGAOGCACCTAAGGGGTATAC.TCTTOAGGATCTTTT phtB.aa^ 

1909 GACGAAGGCCTTTATGAGGCACCTAAGGOGTATACTCTTOAGGATCTTTT phtD. SEP 

X210 C C A A A phtE.SEO 

GOCOACTOTCAAGTACTATOTC OA ACATCCAQACOAAC OTCCGCATTCAO Majority 

2010 2020 2030 2040 2050 
f 1 ' t t • 
2046 TGCGACGATTAAGTACTACGTAGAACACCCTGACGAACGTCCACATTCTA phtA.SE9 
1938 GGCOACTOTCAAGTACTATOTCOAACATCCAAACOAACOTCCaCATTCAO phtB.aaa 
1959 G CCGAC TOTCAAGTACTATGTCOAACATC CAAACOAACQTC CGCATT CAG phtO.SEQ 
ATCAAAT CAAATTOOOCAACC-OAC -TCT TCCAA phtE.SEQ 



1215 



ATAATO QTTT TQOTAACOCTAOCOACCATQTTTTXA-AAACAAQAAAGAT Majority 

2060 2070 2080 2090 2100 . 

1 1 * * t 

2096 ATOATGOATCOaOCAATOCCAOTGAOCATOTGTT AGGCAAGAAAGAC phtA.SEQ 

19B8 ATAATGGTTTTOCTAACOCTAOCOACCATaTirCAAAaAAACAAAAATOGT phtB.scc* 



2009 ATAATGGTTTTGOTAACGCTAGCGACCATGTTCOTAAAAATAAGGTAGAC 



phtO . SZQ 



1247 ACAATAO-TCTAGCAACACCTTCT-CCATCTCTTC CAA T phtS.SECp 
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Figure 7 (g) 



C-XXGCCXCTXXXCCTXXTGXXGXTCXGXXXCXTCXCCXXGTXAC - CAC Majority 

1 » 1 » " I 1 ™ " 1 

2110 2120 2130 2140 2150 

1 * ' ' 
2143 CA CAGTGAAGXT CCAAATAXG phtX.SEQ 

203B CXXGCTGXTA CCAATCAA-XCGGAAAA ACCXXGCGXG-CXGXX phtB.scq 

205 9 CXXGXCAGTAAACCTGATGAAGATAAGGAACATGATGAAGTAXCTGXGCC phtD.SEQ 

1284 CXATCCAG-GAACTTCAC XTGXGXXXCXTGX phtE.SEQ 

X-CTCX--C- Q X X - -» TGXXQX A G X -XXCCACG--G-TTTXXXTCCT- Majority 

I * I I ' I ' 

2160 2X70 2180 2190 2200 

2164 AAC TTCXXX phtA.SEQ 

2079 XCCTCXGXCAGAAAAACCTGXGOXAGA-AACC CCTC phtB.a«q 

2109 AXCTCACCCTGAATCTGATGAAAAAGAGAATCACGCTGGTTTXXATCCTT phtD.SEQ 

1314 XGAAGATCGATACG GATTTGA-TGCT- phtE.SEQ 

-AGCAGXTAAAC C OTATAAOC C AO AC X-XC X X Majority 

1 4 ' 1 1 1 

2210 2220 2230 2240 2250 

> i t i i . 

2173 GCGGATGXX OAGCCAO phtA.SEQ 

2114 GAGAAGAGAAACCGCA-AAGCOAGAAACCAGAOTCTCCAAAACCAACAGA phtB.««q 
2159 CAGCAGATAATCTTTATAAACCAADCACTGATACQGAAaAOACAGAGGAA phtD.SEQ 
1339 -AATCOTATTATC phtE.SEQ 

G-AGCTGOAGO AAXCACCAQATGAOTCAOA A O TXCCTC A A O T A GAG X C T Q Majority 

I t i i ■ 

2260 2270 2280 2290 2300 

■ ' ' 1 

2189 TAGAGGAAACACCTGCTGAGCCAGAAGTCCCTCAAGTAGAGACTG phtA.SEQ 

2163 GGXACCAGAAGAATCACCAGAGGAATCAGAAGAACCTCAGGTCGAGACTC phtB.*«q 

2209 CAAGCTGAAGATACCAC-AGATGAGaCTGAAATTCCTCAAGTAGAGAATT phtD.SEQ 

13S1 GCTGAAGA TOAATC phtE.SEQ 

AXXAXGTTGAA QCXAAACTXAXAQAXQ C X G AOQTTTTQCTTOXAAAACTC Majority 

I i i i . 

2310 2320 2330 2340 2350 

2234 AXXAAGTAGAAGC CCAACTCAAAGAAG CAGAAGTTTTGCTTGCGAAAGTX phtA.SEQ 

2213 AAAAGGTTGAAGAAAAACTGAGAGAGGCTGAAGATTTACTTGGAAAAATC phcB. a«q 

2258 CTCTTATTAACGCTAAOATAOCAGATGCOGAOOCCTTOCTAGAAAAACTA phtD.SEQ 

1365 AGGTTTTO TC phtS.SSQ 

ACGOATCCTAG TATXAAAXCCA ATGCXACGGAOACTCTXACTGGTT T AAA Majority 
I 1 1 » 

2360 2370 2380 2390 2400 

«'tii 
2284 ACGGXTTCTAGTCTGAAAGCCAATGCAACAGAAACTCTAOCTGGTTTACG phtA.SEQ 

2263 CAGGATCCAATTATCAAGTCCAATOCCAAAOAOACTCTCACAGOATTAAA phtB.m*q 

2308 XCXGXTCCTAGTATTAOACAAAATGCTATGGAGACATTOACTOCTCTAAX phtD.SEQ 

1375 ATGAGTC ' ACGGAGACC phtE.SEQ 



AAXTXATTTX C T T C TTQOAACX AAOGATAATAATACTATTTTGG C X G X X G Majority 

1 t I > > 

2410 2420 2430 2440 2450 

i'ii' 

2334 AXXTAATTTOACTCTTCAAATTATOGATAACAATAaTATCATGGCAGXXG phtA.SEQ 

2313 AAATAATTTACTATTTOOCACCCAOOACAACAATACTATTATOOCAGAAG phtB.a»q 

23S8 XXGTXGTCTTCTTCTCGGAACGAAAGATAATAACACTATTTCACCAGAAO phtO.SEO 

1391 ACAAT CATTATTTCTTCA phtE.SEQ 
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Figure 7 (h) 



CACAAAGACTATTOGCTTTGT'rAAAOGAGAGTAJtXT-AAGGT C TT Majority 

3460 2470 2480 2490 2500 

2384 CAGAAAAATTACTTQCGTTOTTAAAAGGAAGTAATC C T T ' phtA SEC 

2363 CTCAAAAACTATTGGCTTTATTAAAGGAGAGTAAGTAAAGGTAGAAGCTT phtB.seq 

2408 TAGATAGTCTCTTGOCTTTOTTAAAAGAAAGT c phtD^SEQ 

1409 -AGAAGGACT- - TGAC AGAAGAGCAAATTAAGOT phtE.SEQ 

**•>-. 

AA--CCO--TCTOOC-CCTA-O-CAA- AA-A-T--TATGGXAAAAGCTXA Majority 



2510 2520 2530 2540 



2550 



1 f ' 1 , 

2453 CA TCTG TAAO TAAGG AAA AAAT - - - phtA.SEQ 

2413 AAGGGCOAATTTOaCACCCAOCACAACAATACTATTATCGCAGAAGCTGA phtB.seq 

2441 A A CCOOCTCCTA TATAOTAAAAGCTTA phtD.SSQ 

1440 OCG CAAAAACATT TAG phtT.SEQ 



AAAACTAXX 



2445 - AAACTAA 
2463 AAAACTATT 

2468 AO CC 

1455 



Majority 



phtA.SEQ 
phtB.seq 
phtO . SSQ 
pfatt.SEQ 
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Figure 8 

A. Strain SJ2 (serotype 6B) 




1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 

Days-post challenge 



B. Strain EF6796 (serotype 6A) 




1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 

Days-post challenge 



C. Strain EF5668 (serotype 4) 




1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 

Days-post challenge 
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SEQUENCE LISTING 

<110> Johnson, Leslie S. 
Koenig, Scott 
Adamou, John E. 

<12 0> Streptococcus Pneumoniae and Immunogenic Fragments for 
Vaccines 

<130> 469201-444 

<140> 
<141> 

<150> 60/113,048 
<151> 1998-12-21 

<160> 11 

<170> Patentln Ver.v 2.0 

<210> 1 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Forward primer 
used in amplification of the Sp3 6 gene sequence. 

<400> 1 

atcggatcct tcttacgagt tgggactgta tcaagc 3 6 

<210> 2 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Forward primer 
used in amplification of the Sp36 gene sequence. 

<400> 2 

atcggatcca ctgtatcaag ctagaacggt taagg 35 

<210> 3 
<211> 40 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Reverse primer 
used in amplification of the Sp3 6 gene sequence. 

<400> 3 

agtcaagctt gtttattttt tccttactta cagatgaagg 4 0 

<210> 4 
<211> 838 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 4 

Met Lys lie Asn Lys Lys Tyr Leu Ala Gly Ser Val Ala Val Leu Ala 
15 10 15 

Leu Ser Val Cys Ser Tyr Glu Leu Gly Arg His Gin Ala Gly Gin Val 

20 25 30 

Lys Lys Glu Ser Asn Arg Val Ser Tyr lie Asp Gly Asp Gin Ala Gly 
35 40 45 

Gin Lys Ala Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly 
50 55 60 

lie Asn Ala Glu Gin He Val He Lys He Thr Asp Gin Gly Tyr Val 
65 70 75 80 

Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr 

85 90 95 

Asp Ala lie lie Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gin 

100 105 110 

Leu Lys Asp Ser Asp He Val Asn Glu He Lys Gly Gly Tyr Val He 
115 120 125 

Lys Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala 
130 135 140 

Asp Asn lie Arg Thr Lys Glu Glu He Lys Arg Gin Lys Gin Glu His 
145 150 155 160 

Ser His Asn His Gly Gly Gly Ser Asn Asp Gin Ala Val Val Ala Ala 

165 170 175 
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Arg Ala Gin Gly 

180 

Ser Asp lie lie 
195 

Asp His Tyr His 
210 

Ala Ala Ala Glu 
225 

Ser Ser Ser Ser 



Asn His Asn Leu 

260 

Asn lie Ser Ser 
275 

Arg His Val Glu 
290 

Ser Arg Thr Ala 
305 

Phe lie Pro Tyr 

lie lie Pro Leu 

340 

Pro Glu Gin Pro 
355 

Gin Pro Ala Pro 
370 

Lys Leu Val Lys 
385 

Glu Glu Asn Gly 

Glu Thr Ala Ala 

420 



Arg Tyr Thr Thr 

Glu Asp Thr Gly 

200 

Tyr lie Pro Lys 
215 

Ala Tyr Trp Asn 
230 

Tyr Asn Ala Asn 
245 

Thr Val Thr Pro 



Leu Leu Arg Glu 

280 

Ser Asp Gly Leu 
295 

Arg Gly Val Ala 
310 

Glu Gin Met Ser 
325 

Arg Tyr Arg Ser 



Ser Pro Gin Ser 

360 

Asn Pro Gin Pro 
375 

Glu Ala Val Arg 
390 

Val Ser Arg Tyr 
405 

Gly lie Asp Ser 



Asp Asp Gly Tyr lie 
185 

Asp Ala Tyr lie Val 

205 

Asn Glu Leu Ser Ala 

220 

Gly Lys Gin Gly Ser 
235 

Pro Ala Gin Pro Arg 
250 

Thr Tyr His Gin Asn 
265 

Leu Tyr Ala Lys Pro 

285 

lie Phe Asp Pro Ala 

300 

Val Pro His Gly Asn 
315 

Glu Leu Glu Lys Arg 
330 

Asn His Trp Val Pro 
345 

Thr Pro Glu Pro Ser 

365 

Ala Pro Ser Asn Pro 

380 

Lys Val Gly Asp Gly 
395 

lie Pro Ala Lys Asp 
410 

Lys Leu Ala Lys Gin 
425 
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Phe Asn Ala 
190 

Pro His Gly 



Ser Glu Leu 



Arg Pro Ser 
240 

Leu Ser Glu 
255 

Gin Gly Glu 
270 

Leu Ser Glu 



Gin He Thr 



His Tyr His 
320 

He Ala Arg 
335 

Asp Ser Arg 
350 

Pro Ser Pro 



lie Asp Glu 



Tyr Val Phe 
400 

Leu Ser Ala 
415 

Glu Ser Leu 
430 
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Ser His Lys Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg 
435 440 445 

Glu Phe Tyr Asn Lys Ala Tyr Asp Leu Leu Ala Arg lie His Gin Asp 
450 455 460 

Leu Leu Asp Asn Lys Gly Arg Gin Val Asp Phe Glu Ala Leu Asp Asn 
465 470 475 480 

Leu Leu Glu Arg Leu Lys Asp Val Pro Ser Asp Lys Val Lys Leu Val 

485 490 495 

Asp Asp lie Leu Ala Phe Leu Ala Pro lie Arg His Pro Glu Arg Leu 

500 505 510 

Gly Lys Pro Asn Ala Gin lie Thr Tyr Thr Asp Asp Glu He Gin Val 
515 520 525 

Ala Lys Leu Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr He Phe Asp 
530 535 540 

Pro Arg Asp lie Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His 
545 550 555 560 

Met Thr His Ser His Trp He Lys Lys Asp Ser Leu Ser Glu Ala Glu 

565 570 575 

Arg Ala Ala Ala Gin Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro 

580 585 590 

Ser Thr Asp His Gin Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu 
N 595 600 605 

Ala lie Tyr Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg 
610 615 620 

Met Pro Tyr Asn Leu Gin Tyr Thr Val Glu Val Lys Asn Gly Ser Leu 
62 5 630 635 640 

He He Pro His Tyr Asp His Tyr His Asn He Lys Phe Glu Trp Phe 

645 650 655 

Asp Glu Gly Leu Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu 

660 665 670 

Leu Ala Thr Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His 
675 680 685 
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Ser Asp Asn Gly 
690 

Val Asp Gin Asp 
705 

Ser Glu Pro Thr 



Leu Asn Pro Ser 

740 

Glu Thr Glu Glu 
755 

Gin Val Glu Asn 
770 

Leu Leu Glu Lys 
785 

Thr Leu Thr Gly 

Asn Thr lie Ser 

820 

Ser Gin Pro Ala 
835 



Phe Gly Asn Ala 
695 

Ser Lys Pro Asp 
710 

His Pro Glu Ser 
725 

Ala Asp Asn Leu 



Glu Ala Glu Asp 

760 

Ser Val lie Asn 
775 

Val Thr Asp Pro 
790 

Leu Lys Ser Ser 
805 

Ala Glu Val Asp 



Pro lie 



Ser Asp His Val 

700 

Glu Asp Lys Glu 
715 

Asp Glu Lys Glu 
730 

Tyr Lys Pro Ser 
745 

Thr Thr Asp Glu 



Ala Lys lie Ala 

780 

Ser He Arg Gin 
795 

Leu Leu Leu Gly 
810 

Ser Leu Leu Ala 
825 



Arg Lys Asn Lys 

His Asp Glu Val 

720 

Asn His Ala Gly 
735 

Thr Asp Thr Glu 
750 

Ala Glu He Pro 
765 

Asp Ala Glu Ala 



Asn Ala Met Glu 

800 

Thr Lys Asp Asn 
815 

Leu Leu Lys Glu 
830 



<210> 5 
<211> 2531 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 5 

atgaaaatta ataaaaaata tctagcaggt tcagtggcag tccttgccct aagtgtttgt 60 
tcctatgaac ttggtcgtca ccaagctggt caggttaaga aagagtctaa tcgagtttct 120 
tatatagatg gtgatcaggc tggtcaaaag gcagaaaact tgacaccaga tgaagtcagt 180 
aagagggagg ggatcaacgc cgaacaaatc gtcatcaaga ttacggatca aggttatgtg 24 0 
acctctcatg gagaccatta tcattactat aatggcaagg tcccttatga tgccatcatc 3 00 
agtgaagagc tcctcatgaa agatccgaat tatcagttga aggattcaga cattgtcaat 3 60 
gaaatcaagg gtggttatgt tatcaaggta gatggaaaat actatgttta ccttaaggat 420 
gcagctcatg cggataatat tcggacaaaa gaagagatta aacgtcagaa gcaggaacac 4 80 
agtcataatc acgggggtgg ttctaacgat caagcagtag ttgcagccag agcccaagga 54 0 
cgctatacaa cggatgatgg ttatatcttc aatgcatctg atatcattga ggacacgggt 6 00 
gatgcttata tcgttcctca cggcgaccat taccattaca ttcctaagaa tgagttatca 6 60 
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gctagcgagt tagctgctgc agaagcctat tggaatggga agcagggatc tcgtccttct 720 
tcaagttcta gttataatgc aaatccagct caaccaagat tgtcagagaa ccacaatctg 780 
actgtcactc caacttatca tcaaaatcaa ggggaaaaca tttcaagcct tttacgtgaa 84 0 
ttgtatgcta aacccttatc agaacgccat gtggaatctg atggccttat tttcgaccca 900 
gcgcaaatca caagtcgaac cgccagaggt gtagctgtcc ctcatggtaa ccattaccac 960 
tttatccctt atgaacaaat gtctgaattg gaaaaacgaa ttgctcgtat tattcccctt 1020 
cgttatcgtt caaaccattg ggtaccagat tcaagaccag aacaaccaag tccacaatcg 1080 
actccggaac ctagtccaag tccgcaacct gcaccaaatc ctcaaccagc tccaagcaat 1140 
ccaattgatg agaaattggt caaagaagct gttcgaaaag taggcgatgg ttatgtcttt 1200 
gaggagaatg gagtttctcg ttatatccca gccaaggatc tttcagcaga aacagcagca 1260 
ggcattgata gcaaactggc caagcaggaa agtttatctc ataagctagg agctaagaaa 1320 
actgacctcc catctagtga tcgagaattt tacaataagg cttatgactt actagcaaga 13 80 
attcaccaag atttacttga taataaaggt cgacaagttg attttgaggc tttggataac 1440 
ctgttggaac gactcaagga tgtcccaagt gataaagtca agttagtgga tgatattctt 1500 
gccttcttag ctccgattcg tcatccagaa cgtttaggaa aaccaaatgc gcaaattacc 1560 
tacactgatg atgagattca agtagccaag ttggcaggca agtacacaac agaagacggt 1620 
tatatctttg atcctcgtga tataaccagt gatgaggggg atgcctatgt aactccacat 1680 
atgacccata gccactggat taaaaaagat agtttgtctg aagctgagag agcggcagcc 1740 
caggcttatg ctaaagagaa aggtttgacc cctccttcga cagaccatca ggattcagga 1800 
aatactgagg caaaaggagc agaagctatc tacaaccgcg tgaaagcagc taagaaggtg 1860 
ccacttgatc gtatgcctta caatcttcaa tatactgtag aagtcaaaaa cggtagttta 1920 
atcatacctc attatgacca ttaccataac atcaaatttg agtggtttga cgaaggcctt 1980 
tatgaggcac ctaaggggta tactcttgag gatcttttgg cgactgtcaa gtactatgtc 2040 
gaacatccaa acgaacgtcc gcattcagat aatggttttg gtaacgctag cgaccatgtt 2100 
cgtaaaaata aggtagacca agacagtaaa cctgatgaag ataaggaaca tgatgaagta 216 0 
agtgagccaa ctcaccctga atctgatgaa aaagagaatc acgctggttt aaatccttca 2220 
gcagataatc tttataaacc aagcactgat acggaagaga cagaggaaga agctgaagat 2280 
accacagatg aggctgaaat tcctcaagta gagaattctg ttattaacgc taagatagca 2340 
gatgcggagg ccttgctaga aaaagtaaca gatcctagta ttagacaaaa tgctatggag 2400 
acattgactg gtctaaaaag tagtcttctt ctcggaacga aagataataa cactatttca 2460 
gcagaagtag atagtctctt ggctttgtta aaagaaagtc aaccggctcc tatatagtaa 2520 
aagcttaagc c 2531 

<210> 6 

<211> 484 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 6 

Met Lys Phe Ser Lys Lys Tyr lie Ala Ala Gly Ser Ala Val He Val 
1 5 10 15 

Ser Leu Ser Leu Cys Ala Tyr Ala Leu Asn Gin His Arg Ser Gin Glu 

20 25 30 

Asn Lys Asp Asn Asn Arg Val Ser Tyr Val Asp Gly Ser Gin Ser Ser 

35 40 45 
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Gin Lys Ser Glu Asn Leu Thr Pro Asp Gin Val Ser Gin Lys Glu Gly 
50 55 60 

He Gin Ala Glu Gin He Val He Lys He Thr Asp Gin Gly Tyr Val 
65 70 75 80 

Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr 

85 90 95 

Asp Ala Leu Phe Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gin 

100 105 no 

Leu Lys Asp Ala Asp He Val Asn Glu Val Lys Gly Gly Tyr He He 
115 120 125 

Lys Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala 
130 135 140 



s 



Asp Asn Val Arg Thr Lys Asp Glu He Asn Arg Gin Lys Gin Glu Hi 
145 150 155 16Q 

Val Lys Asp Asn Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser 

16 5 170 175 

Gin Gly Arg Tyr Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp 

180 185 190 

He He Glu Asp Thr Gly Asn Ala Tyr He Val Pro His Gly Gly His 
195 200 205 

Tyr His Tyr He Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala 
210 215 2 20 



Ala Lys Ala His Leu Ala Gly Lys Asn Met Gin Pro Ser Gin Leu Ser 

235 240 



225 230 



Tyr Ser Ser Thr Ala Ser Asp Asn Asn Thr Gin Ser Val Ala Lys Gly 

245 250 255 

Ser Thr Ser Lys Pro Ala Asn Lys Ser Glu Asn Leu Gin Ser Leu Leu 

260 265 270 

Lys Glu Leu Tyr Asp Ser Pro Ser Ala Gin Arg Tyr Ser Glu Ser Asp 
275 280 285 

Gly Leu Val Phe Asp Pro Ala Lys He He Ser Arg Thr Pro Asn Gly 
290 295 300 
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Val Ala lie Pro 
305 

Leu Ser Ala Leu 



Thr Gly Ser Thr 

340 

Ser Leu Gly Ser 
355 

Glu Leu Ser Ser 
370 

Val Glu Glu Thr 
385 

His Tyr lie Pro 



Asn Ser Leu Ala 

420 

Ser His Glu Lys 
435 

lie lie Ala Glu 
450 

Asn His Tyr Phe 
465 

Arg Lys Asn lie 



His Gly Asp His 
310 

Glu Glu Lys lie 
325 

Val Ser Thr Asn 



Leu Ser Ser Asn 

360 

Ala Ser Asp Gly 
375 

Ala Thr Ala Tyr 
390 

Lys Ser Asn Gin 
405 

Thr Pro Ser Pro 



His Glu Glu Asp 

440 

Asp Glu Ser Gly 
455 

Phe Lys Lys Asp 
470 



Tyr His Phe lie 
315 

Ala Arg Met Val 
330 

Ala Lys Pro Asn 
345 

Pro Ser Ser Leu 



Tyr lie Phe Asn 

380 

lie Val Arg His 
395 

lie Gly Gin Pro 
410 

Ser Leu Pro lie 
425 

Gly Tyr Gly Phe 

Phe Val Met Ser 

460 

Leu Thr Glu Glu 
475 



Pro Tyr Ser Lys 

320 

Pro lie Ser Gly 
335 

Glu Val Val Ser 
350 

Thr Thr Ser Lys 
365 

Pro Lys Asp He 



Gly Asp His Phe 

400 

Thr Leu Pro Asn 
415 

Asn Pro Gly Thr 
430 

Asp Ala Asn Arg 
445 

His Gly Asp His 



Gin He Lys Val 

480 



<210> 7 
<211> 1455 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 7 

atgaaattta gtaaaaaata tatagcagct ggatcagctg ttatcgtatc cttgagtcta 60 
tgtgcctatg cactaaacca gcatcgttcg caggaaaata aggacaataa tcgtgtctct 120 
tatgtggatg gcagccagtc aagtcagaaa agtgaaaact tgacaccaga ccaggttagc 18 0 
cagaaagaag gaattcaggc tgagcaaatt gtaatcaaaa ttacagatca gggctatgta 240 
acgtcacacg gtgaccacta tcattactat aatgggaaag ttccttatga tgccctcttt 300 
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agtgaagaac tcttgatgaa ggatccaaac tatcaactta aagacgctga tattgtcaat 3 60 

gaagtcaagg gtggttatat catcaaggtc gatggaaaat attatgtcta cctgaaagat 420 

gcagctcatg ctgataatgt tcgaactaaa gatgaaatca atcgtcaaaa acaagaacat 480 

gtcaaagata atgagaaggt taactctaat gttgctgtag caaggtctca gggacgatat 54 0 

acgacaaatg atggttatgt ctttaatcca gctgatatta tcgaagatac gggtaatgct 600 

tatatcgttc ctcatggagg tcactatcac tacattccca aaagcgattt atctgctagt 660 

gaattagcag cagctaaagc acatctggct ggaaaaaata tgcaaccgag tcagttaagc 720 

tattcttcaa cagctagtga caataacacg caatctgtag caaaaggatc aactagcaag 780 

ccagcaaata aatctgaaaa tctccagagt cttttgaagg aactctatga ttcacctagc 84 0 

gcccaacgtt acagtgaatc agatggcctg gtctttgacc ctgctaagat tatcagtcgt 900 

acaccaaatg gagttgcgat tccgcatggc gaccattacc actttattcc ttacagcaag 960 

ctttctgcct tagaagaaaa gattgccaga atggtgccta tcagtggaac tggttctaca 1020 

gtttctacaa atgcaaaacc taatgaagta gtgtctagtc taggcagtct ttcaagcaat 1080 

ccttcttctt taacgacaag taaggagctc tcttcagcat ctgatggtta tatttttaat 114 0 

ccaaaagata tcgttgaaga aacggctaca gcttatattg taagacatgg tgatcatttc 1200 

cattacattc caaaatcaaa tcaaattggg caaccgactc ttccaaacaa tagtctagca 1260 

acaccttctc catctcttcc aatcaatcca ggaacttcac atgagaaaca tgaagaagat 1320 

ggatacggat ttgatgctaa tcgtattatc gctgaagatg aatcaggttt tgtcatgagt 13 8 0 

cacggagacc acaatcatta tttcttcaag aaggacttga cagaagagca aattaaggtg 1440 
cgcaaaaaca tttag 1455 



<210> 8 
<211> 819 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 8 

Met Lys lie Asn Lys Lys Tyr Leu Val Gly Ser Ala Ala Ala Leu lie 
1 5 10 15 



Leu Ser Val Cys 

20 

Lys Glu Asn Asn 
35 

Lys Thr Glu Asn 
50 

Asn Ala Glu Gin 
65 

Ser His Gly Asp 

Ala lie lie Ser 

100 



Ser Tyr Glu Leu 



Arg Val Ser Tyr 

40 

Leu Thr Pro Asp 
55 

lie Val He Lys 
70 

His Tyr His Tyr 
85 

Glu Glu Leu Leu 



Gly Leu Tyr Gin 
25 

He Asp Gly Lys 

Glu Val Ser Lys 

60 

lie Thr Asp Gin 
75 

Tyr Asn Gly Lys 
90 

Met Lys Asp Pro 
105 



Ala Arg Thr Val 
30 

Gin Ala Thr Gin 
45 

Arg Glu Gly He 



Gly Tyr Val Thr 

80 

Val Pro Tyr Asp 
95 

Asn Tyr Lys Leu 
110 



9 
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Lys Asp Glu Asp 
115 

Val Asp Gly Lys 
130 

Asn Val Arg Thx 
145 

Gin His Arg Glu 



Ala Arg Ser Gin 

180 

Ala Ser Asp lie 
195 

Gly Asp His Tyr 
210 

Leu Ala Ala Ala 
225 

Ser Arg Thr Tyr 



Trp Val Pro Ser 

260 

Asn Asn Ser Asn 
275 

Ser Leu Leu Lys 
290 

Glu Ser Asp Gly 
305 

Ala Arg Gly Val 

Tyr Ser Gin Met 

340 

Leu Arg Tyr Arg 
355 



lie Val Asn Glu 

120 

Tyr Tyr Val Tyr 
13 5 

Lys Glu Glu lie 
150 

Gly Gly Thr Pro 
165 

Gly Arg Tyr Thr 



lie Glu Asp Thr 

200 

His Tyr lie Pro 
215 

Glu Ala Phe Leu 
230 

Arg Arg Gin Asn 
245 

Val Ser Asn Pro 



Thr Asn Ser Gin 

280 

Gin Leu Tyr Lys 
295 

Leu Val Phe Asp 
310 

Ala Val Pro His 
325 

Ser Glu Leu Glu 



Ser Asn His Trp 

360 



Val Lys Gly Gly 

Leu Lys Asp Ala 

140 

Asn Arg Gin Lys 
155 

Arg Asn Asp Gly 
170 

Thr Asp Asp Gly 
185 

Gly Asp Ala Tyr 

Lys Asn Glu Leu 

220 

Ser Gly Arg Gly 
235 

Ser Asp Asn Thr 
250 

Gly Thr Thr Asn 
265 

Ala Ser Gin Ser 



Leu Pro Leu Ser 

300 

Pro Ala Gin lie 
315 

Gly Asp His Tyr 
330 

Glu Arg He Ala 
345 

Val Pro Asp Ser 



Tyr Val He Lys 
125 

Ala His Ala Asp 



Gin Glu His Ser 

160 

Ala Val Ala Leu 
175 

Tyr lie Phe Asn 
190 

lie Val Pro His 
205 

Ser Ala Ser Glu 



Asn Leu Ser Asn 

240 

Ser Arg Thr Asn 
255 

Thr Asn Thr Ser 
270 

Asn Asp lie Asp 
285 

Gin Arg His Val 



Thr Ser Arg Thr 

320 

His Phe He Pro 
335 

Arg lie lie Pro 
350 

Arg Pro Glu Gin 
365 



10 



BNSDOCID: <WO 0037105A2_I_> 



WO 00/37105 PCT/US99/30390 



Pro Ser Pro Gin Pro Thr Pro Glu Pro Ser Pro Gly Pro Gin Pro Ala 
370 375 3 8 o 

Pro Asn Leu Lys lie Asp Ser Asn Ser Ser Leu Val Ser Gin Leu Val 
385 390 395 400 

Arg Lys Val Gly Glu Gly Tyr Val Phe Glu Glu Lys Gly lie Ser Arg 

405 410 415 

Tyr Val Phe Ala Lys Asp Leu Pro Ser Glu Thr Val Lys Asn Leu Glu 

4 20 425 43Q 

Ser Lys Leu Ser Lys Gin Glu Ser Val Ser His Thr Leu Thr Ala Lys 
435 440 445 

Lys Glu Asn Val Ala Pro Arg Asp Gin Glu Phe Tyr Asp Lys Ala Tyr 
450 455 4 6 o 



Asn Leu Leu Thr Glu Ala His Lys Ala Leu Phe Glu Asn Lys Gly Arg 

480 



465 470 475 



Asn Ser Asp Phe Gin Ala Leu Asp Lys Leu Leu Glu Arg Leu Asn Asp 

485 490 495 

Glu Ser Thr Asn Lys Glu Lys Leu Val Asp Asp Leu Leu Ala Phe Leu 

500 505 510 

Ala Pro lie Thr His Pro Glu Arg Leu Gly Lys Pro Asn Ser Gin lie 
515 520 525 

Glu Tyr Thr Glu Asp Glu Val Arg lie Ala Gin Leu Ala Asp Lys Tyr 
530 535 540 

Thr Thr Ser Asp Gly Tyr He Phe Asp Glu His Asp He lie Ser Asp 
545 550 555 560 

Glu Gly Asp Ala Tyr Val Thr Pro His Met Gly His Ser His Trp He 

565 570 575 

Gly Lys Asp Ser Leu Ser Asp Lys Glu Lys Val Ala Ala Gin Ala Tyr 

580 585 590 

Thr Lys Glu Lys Gly He Leu Pro Pro Ser Pro Asp Ala Asp Val Lys 
595 600 605 

Ala Asn Pro Thr Gly Asp Ser Ala Ala Ala He Tyr Asn Arg Val Lys 
610 63 -5 620 
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Gly Glu Lys Arg 
625 

Thr Val Glu Val 



Tyr His Asn lie 

660 

Pro Asn Gly Tyr 
675 

Val Glu His Pro 
690 

Ala Ser Glu His 
705 

Lys Asn Phe Lys 



Pro Glu Val Pro 

740 

Glu Ala Glu Val 
755 

Asn Ala Thr Glu 
770 

He Met Asp Asn 
785 

Leu Leu Lys Gly 



Lys Leu Asn 



He Pro Leu Val 
630 

Lys Asn Gly Asn 
645 

Lys Phe Ala Trp 

Thr Leu Glu Asp 

680 

Asp Glu Arg Pro 
695 

Val Leu Gly Lys 
710 

Ala Asp Glu Glu 
725 

Gin Val Glu Thr 



Leu Leu Ala Lys 

760 

Thr Leu Ala Gly 
7 75 

Asn Ser He Met 
790 

Ser Asn Pro Ser 
805 



Arg Leu Pro Tyr 
635 

Leu lie He Pro 
650 

Phe Asp Asp His 
665 

Leu Phe Ala Thr 



His Ser Asn Asp 

700 

Lys Asp His Ser 
715 

Pro Val Glu Glu 
730 

Glu Lys Val Glu 
745 

Val Thr Asp Ser 



Leu Arg Asn Asn 

780 

Ala Glu Ala Glu 
795 

Ser Val Ser Lys 
810 



Met Val Glu His 

640 

His Lys Asp His 
655 

Thr Tyr Lys Ala 
670 

He Lys Tyr Tyr 
685 

Gly Trp Gly Asn 



Glu Asp Pro Asn 

720 

Thr Pro Ala Glu 
735 

Ala Gin Leu Lys 
750 

Ser Leu Lys Ala 
765 

Leu Thr Leu Gin 



Lys Leu Leu Ala 

800 

Glu Lys He Asn 
815 



<210> 9 
<211> 2451 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 9 

atgaaaatta ataagaaata ccttgttggt tctgcggcag ctttgatttt aagtgtttgt 60 
tcttacgagt tgggactgta tcaagctaga acggttaagg aaaataatcg tgtttcctat 120 

12 
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atagatggaa aacaagcgac gcaaaaaacg gagaatttga ctcctgatga ggttagcaag 180 

cgtgaaggaa tcaatgctga gcaaatcgtc atcaagataa cagaccaagg ctatgtcact 24 0 

tcacatggcg accactatca ttattacaat ggtaaggttc cttatgacgc tatcatcagt 3 00 

gaagaattac tcatgaaaga tccaaactat aagctaaaag atgaggatat tgttaatgag 3 60 

gtcaagggtg gatatgttat caaggtagat ggaaaatact atgtttacct taaggatgct 420 

gcccacgcgg ataacgtccg tacaaaagag gaaatcaatc gacaaaaaca agagcatagt 4 80 

caacatcgtg aaggtggaac tccaagaaac gatggtgctg ttgccttggc acgttcgcaa 54 0 

ggacgctata ctacagatga tggttatatc tttaatgctt ctgatatcat agaggatact 600 

ggtgatgctt atatcgttcc tcatggagat cattaccatt acattcctaa gaatgagtta 660 

tcagctagcg agttggctgc tgcagaagcc ttcctatctg gtcgaggaaa tctgtcaaat 72 0 

tcaagaacct atcgccgaca aaatagcgat aacacttcaa gaacaaactg ggtaccttct 780 

gtaagcaatc caggaactac aaatactaac acaagcaaca acagcaacac taacagtcaa 84 0 

gcaagtcaaa gtaatgacat tgatagtctc ttgaaacagc tctacaaact gcctttgagt 9 00 

caacgacatg tagaatctga tggccttgtc tttgatccag cacaaatcac aagtcgaaca 960 

gctagaggtg ttgcagtgcc acacggagat cattaccact tcatccctta ctctcaaatg 1020 

tctgaattgg aagaacgaat cgctcgtatt attccccttc gttatcgttc aaaccattgg 1080 

gtaccagatt caaggccaga acaaccaagt ccacaaccga ctccggaacc tagtccaggc 114 0 

ccgcaacctg caccaaatct taaaatagac tcaaattctt ctttggttag tcagctggta 1200 

cgaaaagttg gggaaggata tgtattcgaa gaaaagggca tctctcgtta tgtctttgcg 1260 

aaagatttac catctgaaac tgttaaaaat cttgaaagca agttatcaaa acaagagagt 132 0 

gtttcacaca ctttaactgc taaaaaagaa aatgttgctc ctcgtgacca agaattttat 1380 

gataaagcat ataatctgtt aactgaggct cataaagcct tgtttgnaaa taagggtcgt 144 0 

aattctgatt tccaagcctt agacaaatta ttagaacgct tgaatgatga atcgactaat 1500 

aaagaaaaat tggtagatga tttattggca ttcctagcac caattaccca tccagagcga 1560 

cttggcaaac caaattctca aattgagtat actgaagacg aagttcgtat tgctcaatta 162 0 

gctgataagt atacaacgtc agatggttac atttttgatg aacatgatat aatcagtgat 16 8 0 

gaaggagatg catatgtaac gcctcatatg ggccatagtc actggattgg aaaagatagc 1740 

ctttctgata aggaaaaagt tgcagctcaa gcctatacta aagaaaaagg tatcctacct 1800 

ccatctccag acgcagatgt taaagcaaat ccaactggag atagtgcagc agctatttac 1860 

aatcgtgtga aaggggaaaa acgaattcca ctcgttcgac ttccatatat ggttgagcat 1920 

acagttgagg ttaaaaacgg taatttgatt attcctcata aggatcatta ccataatatt 1980 

aaatttgctt ggtttgatga tcacacatac aaagctccaa atggctatac cttggaagat 2040 

ttgtttgcga cgattaagta ctacgtagaa caccctgacg aacgtccaca ttctaatgat 2100 

ggatggggca atgccagtga gcatgtgtta ggcaagaaag accacagtga agatccaaat 2160 

aagaacttca aagcggatga agagccagta gaggaaacac ctgctgagcc agaagtccct 2220 

caagtagaga ctgaaaaagt agaagcccaa ctcaaagaag cagaagtttt gcttgcgaaa 228 0 

gtaacggatt ctagtctgaa agccaatgca acagaaactc tagctggttt acgaaataat 2340 

ttgactcttc aaattatgga taacaatagt atcatggcag aagcagaaaa attacttgcg 24 00 

ttgttaaaag gaagtaatcc ttcatctgta agtaaggaaa aaataaacta a 24 51 

<210> 10 
<211> 819 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 10 

Met Lys He Asn Lys Lys Tyr Leu Ala Gly Ser Val Ala Val Leu Ala 

15 10 is 
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Leu Ser Val Cys Ser Tyr Glu Leu Gly Arg Tyr Gin Ala Gly Gin Asp 

20 25 30 

Lys Lys Glu Ser Asn Arg Val Ala Tyr lie Asp Gly Asp Gin Ala Gly 
35 40 45 

Gin Lys Ala Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly 
50 55 60 

lie Asn Ala Glu Gin lie Val lie Lys lie Thr Asp Gin Gly Tyr Val 
65 70 75 80 

Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr 

85 90 95 

Asp Ala lie lie Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gin 

100 105 110 

Leu Lys Asp Ser Asp lie Val Asn Glu lie Lys Gly Gly Tyr Val He 
115 120 125 

Lys Val Asn Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala 
130 135 140 

Asp Asn He Arg Thr Lys Glu Glu lie Lys Arg Gin Lys Gin Glu Arg 
145 150 155 160 

Ser His Asn His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg 

165 170 175 

Ala Gin Gly Arg Tyr Thr Thr Asp Asp Gly Tyr He Phe Asn Ala Ser 

180 185 190 

Asp lie lie Glu Asp Thr Gly Asp Ala Tyr lie Val Pro His Gly Asp 
195 200 205 

His Tyr His Tyr lie Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala 
210 215 220 

Ala Ala Glu Ala Tyr Trp Asn Gly Lys Gin Gly Ser Arg Pro Ser Ser 
225 230 235 240 

Ser Ser Ser Tyr Asn Ala Asn Pro Ala Gin Pro Arg Leu Ser Glu Asn 

245 250 255 

His Asn Leu Thr Val Thr Pro Thr Tyr His Gin Asn Gin Gly Glu Asn 

260 265 270 
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lie Ser Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg 
275 280 285 

His Val Glu Ser Asp Gly Leu lie Phe Asp Pro Ala Gin He Thr Ser 
290 295 300 

Arg Thr Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe 
305 310 315 320 

He Pro Tyr Glu Gin Met Ser Glu Leu Glu Lys Arg He Ala Arg He 

325 330 335 

He Pro Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro 

340 345 350 

Glu Glu Pro Ser Pro Gin Pro Thr Pro Glu Pro Ser Pro Ser Pro Gin 
355 360 365 

Pro Ala Pro Ser Asn Pro He Asp Gly Lys Leu Val Lys Glu Ala Val 
370 375 380 

Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser Arg 
385 390 395 400 

Tyr lie Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala Gly He Asp 

405 410 415 

Ser Lys Leu Ala Lys Gin Glu Ser Leu Ser His Lys Leu Gly Thr Lys 

420 425 430 

Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala Tyr 
435 440 445 

Asp Leu Leu Ala Arg He His Gin Asp Leu Leu Asp Asn Lys Gly Arg 
450 455 460 

Gin Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys Asp 
465 470 475 480 

Val Ser Ser Asp Lys Val Lys Leu Val Glu Asp lie Leu Ala Phe Leu 

485 490 495 

Ala Pro lie Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ala Gin lie 

500 505 510 

Thr Tyr Thr Asp Asp Glu He Gin Val Ala Lys Leu Ala Gly Lys Tyr 
515 520 525 
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Thr Ala Glu Asp 
530 

Glu Gly Asp Ala 
545 

Lys Lys Asp Ser 



Ala Glu Glu Lys 

580 

Gly Asn Thr Glu 
595 

Ala Ala Lys Lys 
610 

Thr Val Glu Val 
625 

Tyr His Asn lie 



Pro Lys Gly Tyr 

660 

Val Glu His Pro 
675 

Ala Ser Asp His 
690 

Gin Thr Glu Lys 
705 

Glu Glu Thr Pro 



Pro Lys Pro Thr 

740 

Pro Gin Val Glu 
755 

Asp Leu Leu Gly 
770 



Gly Tyr lie Phe 
535 

Tyr Val Thr Pro 
550 

Leu Ser Glu Ala 
565 

Gly Leu Thr Pro 



Ala Lys Gly Ala 

600 

Val Pro Leu Asp 
615 

Lys Asn Gly Ser 
630 

Lys Phe Glu Trp 
645 

Thr Leu Glu Asp 



Asn Glu Arg Pro 

680 

Val Gin Arg Asn 
695 

Pro Ser Glu Glu 
710 

Arg Glu Glu Lys 
725 

Glu Glu Pro Glu 



Thr Glu Lys Val 

760 

Lys lie Gin Asp 
775 



Asp Pro Arg Asp 

540 

His Met Thr His 
555 

Glu Arg Ala Ala 
570 

Pro Ser Thr Asp 
585 

Glu Ala lie Tyr 



Arg Me t Pro Tyr 

620 

Leu lie lie Pro 
635 

Phe Asp Glu Gly 
650 

Leu Leu Ala Thr 
665 

His Ser Asp Asn 



Lys Asn Gly Gin 

700 

Lys Pro Gin Thr 
715 

Pro Gin Ser Glu 
730 

Glu Ser Pro Glu 
745 

Glu Glu Lys Leu 



Pro lie lie Lys 

780 



lie Thr Ser Asp 



Ser His Trp lie 

560 

Ala Gin Ala Tyr 
575 

His Gin Asp Ser 
590 

Asn Arg Val Lys 
605 

Asn Leu Gin Tyr 



His Tyr Asp His 

640 

Leu Tyr Glu Ala 
655 

Val Lys Tyr Tyr 
670 

Gly Phe Gly Asn 
685 

Ala Asp Thr Asn 



Glu Lys Pro Glu 

720 

Lys Pro Glu Ser 
735 

Glu Ser Glu Glu 
750 

Arg Glu Ala Glu 

,765 

Ser Asn Ala Lys 
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Glu Thr Leu Thr Gly Leu Lys Asn Asn Leu Leu Phe Gly Thr Gin Asp 
785 790 795 800 

Asn Asn Thr lie Met Ala Glu Ala Glu Lys Leu Leu Ala Leu Leu Lys 

805 810 815 

Glu Ser Lys 



<210> 11 
<211> 2531 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 11 

atgaaaatta ataaaaaata tctagcaggt tcagtggcag tccttgccct aagtgtttgt 60 
tcctatgagc ttggacgtta ccaagctggt caggataaga aagagtctaa tcgagttgct 120 
tatatagatg gtgatcaggc tggtcaaaag gcagaaaact tgacaccaga tgaagtcagt 180 
aagagggagg ggatcaacgc cgaacaaatt gttatcaaga ttacggatca aggttatgtg 24 0 
acctctcatg gagaccatta tcattactat aatggcaagg ttccttatga tgccatcatc 3 00 
agtgaagagc tcctcatgaa agatccgaat tatcagttga aggattcaga cattgtcaat 360 
gaaatcaagg gtggttatgt cattaaggta aacggtaaat actatgttta ccttaaggat 42 0 
gcrgctcatg cggataatat tcggacaaaa gaagagatta aacgtcagaa gcaggaacgc 480 
agtcataatc ataactcaag agcagataat gctgttgctg cagccagagc ccaaggacgt 54 0 
tatacaacgg atgatgggta tatcttcaat gcatctgata tcattgagga cacgggtgat 600 
gcttatatcg ttcctcacgg cgaccattac cattacattc ctaagaatga gttatcagct 660 
agcgagttag ctgctgcaga agcctattgg aatgggaagc agggatctcg tccttcttca 72 0 
agttctagtt ataatgcaaa tccagctcaa ccaagattgt cagagaacca caatctgact 78 0 
gtcactccaa cttatcatca aaatcaaggg gaaaacattt caagcctttt acgtgaattg 84 0 
tatgctaaac ccttatcaga acgccatgtg gaatctgatg gccttatttt cgacccagcg 900 
caaatcacaa gtcgaaccgc cagaggtgta gctgtccctc atggtaacca ttaccacttt 96 0 
atcccttatg aacaaatgtc tgaattggaa aaacgaattg ctcgtattat tccccttcgt 102 0 
tatcgttcaa accattgggt accagattca agaccagaag aaccaagtcc acaaccgact 1080 
ccagaaccta gtccaagtcc gcaaccagct ccaagcaatc caattgatgg gaaattggtc 1140 
aaagaagctg ttcgaaaagt aggcgatggt tatgtctttg aggagaatgg agtttctcgt 1200 
tatatcccag ccaaggatct ttcagcagaa acagcagcag gcattgatag caaactggcc 1260 
aagcaggaaa gtttatctca taagctagga actaagaaaa ctgacctccc atctagtgat 1320 
cgagaatttt acaataaggc ttatgactta ctagcaagaa ttcaccaaga tttacttgat 1380 
aataaaggtc gacaagttga ttttgaggct ttggataacc tgttggaacg actcaaggat 1440 
gtctcaagtg ataaagtcaa gttagtggaa gatattcttg ccttcttagc tccgattcgt 1500 
catccagaac gtttaggaaa accaaatgcg caaattacct acactgatga tgagattcaa 1560 
gtagccaagt tggcaggcaa gtacacagca gaagacggtt atatctttga tcctcgtgat 1620 
ataaccagtg atgaggggga tgcctatgta actccacata tgacccatag ccactggatt 168 0 
aaaaaagata gtttgtctga agctgagaga gcggcagccc aggcttatgc traagagaaa 174 0 
ggtttgaccc ctccttcgac agaccatcag gattcaggaa atactgaggc aaaaggagca 1800 
gaagctatct acaaccgmgt gaaagcagct aagaaggtgc cacttgatcg tatgccttac 1860 
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aatcttcaat atactgtaga agtcaaaaac 
taccataaca tcaaatttga gtggtttgac 
actcttgagg atcttttggc gactgtcaag 
cattcagata atggttttgg taacgctagc 
gctgatacca atcaaacgga aaaaccaagc 
gaagaaaccc ctcgagaaga gaaaccgcaa 
gaggaaccag aagaatcacc agaggaatca 
gaagaaaaac tgagagaggc tgaagattta 
tccaatgcca aagagactct cacaggatta 
aacaatacta ttatggcaga agctgaaaaa 
aggtagaagc ttaagggcga atttggcacc 
gaaaaactat t 



ggtagtttaa tcatacctca ttatgaccat 192 0 
gaaggccttt atgaggcacc taaggggtat 1980 
trictatgtcg aacatccaaa cgaacgtccg 204 0 
gaccatgttc aaagaaacaa aaatggtcaa 2100 
gaggagaaac ctcagacaga aaaacctgag 2160 
agcgagaaac cagagtctcc aaaaccaaca 222 0 
gaagaacctc aggtcgagac tgaaaaggtt 2280 
cttggaaaaa tccaggatcc aattatcaag 234 0 
aaaaataatt tactatttgej cacccaggac 24 0 0 
ctattggctt tattaaagga gagtaagtaa 2460 
caggacaaca atactattat ggcagaagct 2520 

2531 
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